化学
药物发现
口译(哲学)
鉴定(生物学)
计算生物学
DNA
结合亲和力
钥匙(锁)
机器学习
过程(计算)
DNA测序
选择(遗传算法)
人工智能
数据挖掘
数据科学
程序设计语言
生物化学
计算机科学
生物
植物
受体
计算机安全
操作系统
作者
Moreno Wichert,Laura Guasch,Raphael M. Franzini
出处
期刊:Chemical Reviews
[American Chemical Society]
日期:2024-11-07
卷期号:124 (22): 12551-12572
标识
DOI:10.1021/acs.chemrev.4c00284
摘要
DNA-encoded library (DEL) technology is a powerful platform for the efficient identification of novel chemical matter in the early drug discovery process enabled by parallel screening of vast libraries of encoded small molecules through affinity selection and deep sequencing. While DEL selections provide rich data sets for computational drug discovery, the underlying technical factors influencing DEL data remain incompletely understood. This review systematically examines the key parameters affecting the chemical information in DEL data and their impact on hit triaging and machine learning integration. The need for rigorous data handling and interpretation is emphasized, with standardized methods being critical for the success of DEL-based approaches. Major challenges include the relationship between sequence counts and binding affinities, frequent hitters, and the influence of factors such as inhomogeneous library composition, DNA damage, and linkers on binding modes. Experimental artifacts, such as those caused by protein immobilization and screening matrix effects, further complicate data interpretation. Recent advancements in using machine learning to denoise DEL data and predict drug candidates are highlighted. This review offers practical guidance on adopting best practices for integrating robust methodologies, comprehensive data analysis, and computational tools to improve the accuracy and efficacy of DEL-driven hit discovery.
科研通智能强力驱动
Strongly Powered by AbleSci AI