药物数据库
虚拟筛选
随机森林
试验装置
机器学习
训练集
计算机科学
人工智能
聚类分析
特征选择
计算生物学
分子描述符
支持向量机
诱饵
生物信息学
药物发现
医学
数量结构-活动关系
化学
药品
生物
药理学
生物化学
受体
作者
Sharath Belenahalli Shekarappa,Shivananda Kandagalla,Julian Lee
摘要
Janus kinase 2 (JAK2) is emerging as a potential therapeutic target for many inflammatory diseases such as myeloproliferative disorders (MPD), cancer and rheumatoid arthritis (RA). In this study, we have collected experimental data of JAK2 protein containing 6021 unique inhibitors. We then characterized them based on Morgan (ECFP6) fingerprints followed by clustering into training and test set based on their molecular scaffolds. These data were used to build the classification models with various supervised machine learning (ML) algorithms that could prioritize novel inhibitors for future drug development against JAK2 protein. The best model built by Random Forest (RF) and Morgan fingerprints achieved the G-mean value of 0.84 on the external test set. As an application of our classification model, virtual screening was performed against Drugbank molecules in order to identify the potential inhibitors based on the confidence score by RF model. Nine potential molecules were identified, which were further subject to molecular docking studies to evaluate the virtual screening results of the best RF model. This proposed method can prove useful for developing novel target-specific JAK2 inhibitors.
科研通智能强力驱动
Strongly Powered by AbleSci AI