鉴定(生物学)
主动学习(机器学习)
计算机科学
人工智能
机器学习
生物
植物
作者
Yongfei Yang,Wenli Gan,Lin Lei,Long Wang,Jianming Wu,Jiesi Luo
标识
DOI:10.1021/acs.jcim.4c00718
摘要
Thrombocytopenia, which is associated with thrombopoietin (TPO) deficiency, presents very limited treatment options and can lead to life-threatening complications. Discovering new therapeutic agents against thrombocytopenia has proven to be a challenging task using traditional screening approaches. Fortunately, machine learning (ML) techniques offer a rapid avenue for exploring chemical space, thereby increasing the likelihood of uncovering new drug candidates. In this study, we focused on computational modeling for drug-induced megakaryocyte differentiation and platelet production using ML methods, aiming to gain insights into the structural characteristics of hematopoietic activity. We developed 112 different classifiers by combining eight ML algorithms with 14 molecule features. The top-performing model achieved good results on both 5-fold cross-validation (with an accuracy of 81.6% and MCC value of 0.589) and external validation (with an accuracy of 83.1% and MCC value of 0.642). Additionally, by leveraging the Shapley additive explanations method, the best model provided quantitative assessments of molecular properties and structures that significantly contributed to the predictions. Furthermore, we employed an ensemble strategy to integrate predictions from multiple models and performed in silico predictions for new molecules with potential activity against thrombocytopenia, sourced from traditional Chinese medicine and the Drug Repurposing Hub. The findings of this study could offer valuable insights into the structural characteristics and computational prediction of thrombopoiesis inducers.
科研通智能强力驱动
Strongly Powered by AbleSci AI