Classification models and SAR analysis on HDAC1 inhibitors using machine learning methods

随机森林 人工智能 支持向量机 异羟肟酸 计算机科学 HDAC1型 试验装置 机器学习 人工神经网络 组蛋白脱乙酰基酶 模式识别(心理学) 计算生物学 化学 生物 组蛋白 生物化学 立体化学 基因
作者
Rourou Li,Yujia Tian,Zhenwu Yang,Yueshan Ji,Jiaqi Ding,Aixia Yan
出处
期刊:Molecular Diversity [Springer Science+Business Media]
卷期号:27 (3): 1037-1051 被引量:6
标识
DOI:10.1007/s11030-022-10466-w
摘要

Histone deacetylase (HDAC) 1, a member of the histone deacetylases family, plays a pivotal role in various tumors. In this study, we collected 7313 human HDAC1 inhibitors with bioactivities to form a dataset. Then, the dataset was divided into a training set and a test set using two splitting methods: (1) Kohonen’s self-organizing map and (2) random splitting. The molecular structures were represented by MACCS fingerprints, RDKit fingerprints, topological torsions fingerprints and ECFP4 fingerprints. A total of 80 classification models were built by using five machine learning methods, including decision tree (DT), random forest, support vector machine, eXtreme Gradient Boosting and deep neural network. Model 15A_2 built by the XGBoost algorithm based on ECFP4 fingerprints showed the best performance, with an accuracy of 88.08% and an MCC value of 0.76 on the test set. Finally, we clustered the 7313 HDAC1 inhibitors into 31 subsets, and the substructural features in each subset were investigated. Moreover, using DT algorithm we analyzed the structure–activity relationship of HDAC1 inhibitors. It may conclude that some substructures have a significant effect on high activity, such as N-(2-amino-phenyl)-benzamide, benzimidazole, AR-42 analogues, hydroxamic acid with a middle chain alkyl and 4-aryl imidazole with a midchain of alkyl whose α carbon is chiral.Graphical abstract

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
史迪仔完成签到,获得积分10
1秒前
1秒前
雁菡清清完成签到 ,获得积分10
1秒前
12345678发布了新的文献求助10
1秒前
王小小翔完成签到,获得积分10
1秒前
1秒前
充电宝应助Sissi采纳,获得30
1秒前
yyl完成签到,获得积分10
1秒前
研友_VZG7GZ应助那个谁谁采纳,获得10
1秒前
2秒前
2秒前
李健的小迷弟应助小唐采纳,获得10
2秒前
2秒前
2秒前
商羽发布了新的文献求助10
2秒前
3秒前
PEI发布了新的文献求助10
3秒前
12332145678发布了新的文献求助10
3秒前
冲鸭完成签到,获得积分10
3秒前
CEJ发布了新的文献求助10
4秒前
七分之二十四完成签到,获得积分10
4秒前
4秒前
小宇宙发布了新的文献求助10
5秒前
1213完成签到,获得积分10
5秒前
5秒前
Aiden发布了新的文献求助10
8秒前
8秒前
12332145678完成签到,获得积分10
8秒前
8秒前
隐形曼青应助薛定谔的猫采纳,获得10
8秒前
9秒前
领导范儿应助谷雨采纳,获得10
9秒前
科科完成签到,获得积分10
11秒前
briskguo发布了新的文献求助10
11秒前
金也完成签到,获得积分10
11秒前
12秒前
12秒前
wanci应助吴小利采纳,获得20
12秒前
传奇3应助包包采纳,获得10
13秒前
yzj发布了新的文献求助10
14秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Lewis’s Child and Adolescent Psychiatry: A Comprehensive Textbook Sixth Edition 2000
Cronologia da história de Macau 1600
Treatment response-adapted risk index model for survival prediction and adjuvant chemotherapy selection in nonmetastatic nasopharyngeal carcinoma 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
Toughness acceptance criteria for rack materials and weldments in jack-ups 800
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6207516
求助须知:如何正确求助?哪些是违规求助? 8033933
关于积分的说明 16735180
捐赠科研通 5298291
什么是DOI,文献DOI怎么找? 2823034
邀请新用户注册赠送积分活动 1801949
关于科研通互助平台的介绍 1663415