化学
药物发现
启发式
合成数据
计算机科学
生成模型
集合(抽象数据类型)
合成生物学
渡线
算法
生成语法
遗传算法
人工智能
机器学习
计算生物学
生物
生物化学
程序设计语言
作者
Jike Wang,Xiaorui Wang,Huiyong Sun,Mingyang Wang,Yiyu Zeng,Dejun Jiang,Zhenxing Wu,Zeyi Liu,Ben Liao,Xiaojun Yao,Chang‐Yu Hsieh,Dongsheng Cao,Xi Chen,Tingjun Hou
标识
DOI:10.1021/acs.jmedchem.2c01179
摘要
Many deep learning (DL)-based molecular generative models have been proposed to design novel molecules. These models may perform well on benchmarks, but they usually do not take real-world constraints into account, such as available training data set, synthetic accessibility, and scaffold diversity in drug discovery. In this study, a new algorithm, ChemistGA, was proposed by combining the traditional heuristic algorithm with DL, in which the crossover of the traditional genetic algorithm (GA) was redefined by DL in conjunction with GA, and an innovative backcrossing operation was implemented to generate desired molecules. Our results clearly show that ChemistGA not only retains the strength of the traditional GA but also greatly enhances the synthetic accessibility and success rate of the generated molecules with desired properties. Calculations on the two benchmarks illustrate that ChemistGA achieves impressive performance among the state-of-the-art baselines, and it opens a new avenue for the application of generative models to real-world drug discovery scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI