Distinguishing drug/non-drug-like small molecules in drug discovery using deep belief network

药品 概化理论 药物发现 公共化学 计算机科学 药物开发 人工智能 机器学习 批准的药物 药理学 数据挖掘 医学 计算生物学 生物信息学 数学 统计 生物
作者
Seyed Aghil Hooshmand,Sadegh Azimzadeh Jamalkandi,Seyed Mehdi Alavi,Ali Masoudi‐Nejad
出处
期刊:Molecular Diversity [Springer Science+Business Media]
卷期号:25 (2): 827-838 被引量:20
标识
DOI:10.1007/s11030-020-10065-7
摘要

The advent of computational methods for efficient prediction of the druglikeness of small molecules and their ever-burgeoning applications in the fields of medicinal chemistry and drug industries have been a profound scientific development, since only a few amounts of the small molecule libraries were identified as approvable drugs. In this study, a deep belief network was utilized to construct a druglikeness classification model. For this purpose, small molecules and approved drugs from the ZINC database were selected for the unsupervised pre-training step and supervised training step. Various binary fingerprints such as Macc 166 bit, PubChem 881 bit, and Morgan 2048 bit as data features were investigated. The report revealed that using an unsupervised pre-training phase can lead to a good performance model and generalizability capability. Accuracy, precision, and recall of the model for Macc features were 97%, 96%, and 99%, respectively. For more consideration about the generalizability of the model, the external data by expression and investigational drugs in drug banks as drug data and randomly selected data from the ZINC database as non-drug were created. The results confirmed the good performance and generalizability capability of the model. Also, the outcomes depicted that a large proportion of misclassified non-drug small molecules ascertain the bioavailability conditions and could be investigated as a drug in the future. Furthermore, our model attempted to tap potential opportunities as a drug filter in drug discovery.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Jane发布了新的文献求助10
刚刚
上官若男应助LJP采纳,获得10
刚刚
2秒前
淡然子轩完成签到,获得积分10
3秒前
4秒前
高挑的涛发布了新的文献求助30
6秒前
7秒前
kim发布了新的文献求助10
7秒前
sing发布了新的文献求助10
8秒前
8秒前
9秒前
勇敢牛牛完成签到 ,获得积分10
9秒前
丘比特应助韶孤容采纳,获得10
9秒前
Blank完成签到 ,获得积分10
10秒前
10秒前
10秒前
10秒前
haki发布了新的文献求助10
11秒前
11秒前
tangyuan发布了新的文献求助10
11秒前
13秒前
白白白完成签到 ,获得积分10
13秒前
LJP发布了新的文献求助10
14秒前
花花花完成签到,获得积分10
14秒前
小宇发布了新的文献求助10
15秒前
111发布了新的文献求助10
15秒前
岚风玉发布了新的文献求助10
17秒前
huahua完成签到,获得积分10
17秒前
19秒前
激动的帽子完成签到 ,获得积分10
20秒前
21秒前
22秒前
22秒前
橘子和柚子完成签到 ,获得积分10
23秒前
24秒前
洛希极限完成签到,获得积分10
25秒前
chi发布了新的文献求助10
25秒前
25秒前
tangyuan完成签到,获得积分20
25秒前
解语花031发布了新的文献求助10
26秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Various Faces of Animal Metaphor in English and Polish 800
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Mass participant sport event brand associations: an analysis of two event categories 500
Photodetectors: From Ultraviolet to Infrared 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6354716
求助须知:如何正确求助?哪些是违规求助? 8169827
关于积分的说明 17198056
捐赠科研通 5410714
什么是DOI,文献DOI怎么找? 2864105
邀请新用户注册赠送积分活动 1841625
关于科研通互助平台的介绍 1690066