可解释性
计算机科学
稳健性(进化)
机器学习
药物发现
人工智能
过程(计算)
代表(政治)
构造(python库)
结核分枝杆菌
数据挖掘
化学
肺结核
医学
生物化学
病理
政治
政治学
法学
基因
程序设计语言
操作系统
作者
Lantian Yao,Jiahui Guan,Wenshuo Li,Chia‐Ru Chung,Junyang Deng,Ying‐Chih Chiang,Tzong-Yi Lee
标识
DOI:10.1021/acs.analchem.3c04196
摘要
Tuberculosis (TB) is a severe disease caused by Mycobacterium tuberculosis that poses a significant threat to human health. The emergence of drug-resistant strains has made the global fight against TB even more challenging. Antituberculosis peptides (ATPs) have shown promising results as a potential treatment for TB. However, conventional wet lab-based approaches to ATP discovery are time-consuming and costly and often fail to discover peptides with desired properties. To address these challenges, we propose a novel machine learning-based framework called ATPfinder that can significantly accelerate the discovery of ATP. Our approach integrates various efficient peptide descriptors and utilizes the deep forest algorithm to construct the model. This neural network-like cascading structure can effectively process and mine features without complex hyperparameter tuning. Our experimental results show that ATPfinder outperforms existing ATP prediction tools, achieving state-of-the-art performance with an accuracy of 89.3% and an MCC of 0.70. Moreover, our framework exhibits better robustness than baseline algorithms commonly used for other sequence analysis tasks. Additionally, the excellent interpretability of our model can assist researchers in understanding the critical features of ATP. Finally, we developed a downloadable desktop application to simplify the use of our framework for researchers. Therefore, ATPfinder can facilitate the discovery of peptide drugs and provide potential solutions for TB treatment. Our framework is freely available at https://github.com/lantianyao/ATPfinder/ (data sets and code) and https://awi.cuhk.edu.cn/dbAMP/ATPfinder.html (software).
科研通智能强力驱动
Strongly Powered by AbleSci AI