Distinguishing drug/non-drug-like small molecules in drug discovery using deep belief network

药品 概化理论 药物发现 公共化学 计算机科学 药物开发 人工智能 机器学习 批准的药物 药理学 数据挖掘 医学 计算生物学 生物信息学 数学 统计 生物
作者
Seyed Aghil Hooshmand,Sadegh Azimzadeh Jamalkandi,Seyed Mehdi Alavi,Ali Masoudi‐Nejad
出处
期刊:Molecular Diversity [Springer Science+Business Media]
卷期号:25 (2): 827-838 被引量:20
标识
DOI:10.1007/s11030-020-10065-7
摘要

The advent of computational methods for efficient prediction of the druglikeness of small molecules and their ever-burgeoning applications in the fields of medicinal chemistry and drug industries have been a profound scientific development, since only a few amounts of the small molecule libraries were identified as approvable drugs. In this study, a deep belief network was utilized to construct a druglikeness classification model. For this purpose, small molecules and approved drugs from the ZINC database were selected for the unsupervised pre-training step and supervised training step. Various binary fingerprints such as Macc 166 bit, PubChem 881 bit, and Morgan 2048 bit as data features were investigated. The report revealed that using an unsupervised pre-training phase can lead to a good performance model and generalizability capability. Accuracy, precision, and recall of the model for Macc features were 97%, 96%, and 99%, respectively. For more consideration about the generalizability of the model, the external data by expression and investigational drugs in drug banks as drug data and randomly selected data from the ZINC database as non-drug were created. The results confirmed the good performance and generalizability capability of the model. Also, the outcomes depicted that a large proportion of misclassified non-drug small molecules ascertain the bioavailability conditions and could be investigated as a drug in the future. Furthermore, our model attempted to tap potential opportunities as a drug filter in drug discovery.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
melman发布了新的文献求助10
7秒前
芒果完成签到,获得积分10
8秒前
酷炫的世立应助apt采纳,获得10
9秒前
易安完成签到,获得积分10
10秒前
葳蕤苍生完成签到,获得积分10
10秒前
11秒前
卡皮巴拉完成签到,获得积分10
12秒前
释棱完成签到 ,获得积分10
16秒前
17秒前
草学研究完成签到,获得积分10
22秒前
23秒前
23秒前
27秒前
lx33101128发布了新的文献求助10
28秒前
AaronDP发布了新的文献求助30
30秒前
书白完成签到,获得积分10
31秒前
mihhhhh发布了新的文献求助10
31秒前
xiaofei应助青杉杉采纳,获得10
33秒前
35秒前
xiaoshulin完成签到,获得积分10
37秒前
再一完成签到,获得积分10
39秒前
444发布了新的文献求助10
40秒前
43秒前
43秒前
852应助xxxgoldxsx采纳,获得10
44秒前
飞刀又见飞刀完成签到,获得积分10
45秒前
邱锐杰发布了新的文献求助10
46秒前
mihhhhh完成签到,获得积分10
47秒前
49秒前
51秒前
444完成签到,获得积分10
52秒前
科研通AI2S应助燕麦片采纳,获得10
54秒前
lx33101128发布了新的文献求助10
55秒前
AaronDP完成签到,获得积分10
1分钟前
1分钟前
zhenzheng完成签到 ,获得积分10
1分钟前
科研通AI6.1应助Xiyixuan采纳,获得10
1分钟前
Pony完成签到,获得积分10
1分钟前
蓝橙完成签到,获得积分10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Various Faces of Animal Metaphor in English and Polish 800
Signals, Systems, and Signal Processing 610
Superabsorbent Polymers: Synthesis, Properties and Applications 500
Photodetectors: From Ultraviolet to Infrared 500
On the Dragon Seas, a sailor's adventures in the far east 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6351186
求助须知:如何正确求助?哪些是违规求助? 8165830
关于积分的说明 17184471
捐赠科研通 5407344
什么是DOI,文献DOI怎么找? 2862894
邀请新用户注册赠送积分活动 1840427
关于科研通互助平台的介绍 1689539