计算生物学
药物发现
蛋白质组
计算机科学
药品
人工智能
模式识别(心理学)
生物信息学
生物
药理学
作者
Xiangxiang Zeng,Hongxin Xiang,Linhui Yu,Jianmin Wang,Kenli Li,Ruth Nussinov,Feixiong Cheng
标识
DOI:10.1038/s42256-022-00557-6
摘要
The clinical efficacy and safety of a drug is determined by its molecular properties and targets in humans. However, proteome-wide evaluation of all compounds in humans, or even animal models, is challenging. In this study, we present an unsupervised pretraining deep learning framework, named ImageMol, pretrained on 10 million unlabelled drug-like, bioactive molecules, to predict molecular targets of candidate compounds. The ImageMol framework is designed to pretrain chemical representations from unlabelled molecular images on the basis of local and global structural characteristics of molecules from pixels. We demonstrate high performance of ImageMol in evaluation of molecular properties (that is, the drug’s metabolism, brain penetration and toxicity) and molecular target profiles (that is, beta-secretase enzyme and kinases) across 51 benchmark datasets. ImageMol shows high accuracy in identifying anti-SARS-CoV-2 molecules across 13 high-throughput experimental datasets from the National Center for Advancing Translational Sciences. Via ImageMol, we identified candidate clinical 3C-like protease inhibitors for potential treatment of COVID-19. Predicting the properties of a molecule from its structure with high accuracy is a crucial problem in digital drug design. Instead of sequence features, Zeng and colleagues use an image representation of a large collection of bioactive molecules to pretrain a model that can be fine-tuned on specific property prediction tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI