化学
口译(哲学)
计算生物学
DNA
数据科学
纳米技术
程序设计语言
生物化学
计算机科学
材料科学
生物
作者
Moreno Wichert,Laura Guasch,Raphael M. Franzini
标识
DOI:10.1021/acs.chemrev.4c00284
摘要
DNA-encoded library (DEL) technology is a powerful platform for the efficient identification of novel chemical matter in the early drug discovery process enabled by parallel screening of vast libraries of encoded small molecules through affinity selection and deep sequencing. While DEL selections provide rich data sets for computational drug discovery, the underlying technical factors influencing DEL data remain incompletely understood. This review systematically examines the key parameters affecting the chemical information in DEL data and their impact on hit triaging and machine learning integration. The need for rigorous data handling and interpretation is emphasized, with standardized methods being critical for the success of DEL-based approaches. Major challenges include the relationship between sequence counts and binding affinities, frequent hitters, and the influence of factors such as inhomogeneous library composition, DNA damage, and linkers on binding modes. Experimental artifacts, such as those caused by protein immobilization and screening matrix effects, further complicate data interpretation. Recent advancements in using machine learning to denoise DEL data and predict drug candidates are highlighted. This review offers practical guidance on adopting best practices for integrating robust methodologies, comprehensive data analysis, and computational tools to improve the accuracy and efficacy of DEL-driven hit discovery.
科研通智能强力驱动
Strongly Powered by AbleSci AI