数量结构-活动关系
机器学习
鉴定(生物学)
生物信息学
人工智能
计算机科学
过程(计算)
聚类分析
预测建模
偏最小二乘回归
药物开发
主成分分析
数据挖掘
生化工程
化学
工程类
药品
生物
操作系统
基因
药理学
植物
生物化学
作者
James H. Zothantluanga,Dipak Chetia,Sanchaita Rajkhowa,Abd. Kakhar Umar
标识
DOI:10.1080/1062936x.2023.2169347
摘要
Identification of lead compounds with the traditional laboratory approach is expensive and time-consuming. Nowadays, in silico techniques have emerged as a promising approach for lead identification. In this study, we aim to develop robust and predictive 2D-QSAR models to identify lead flavonoids by predicting the IC50 against Plasmodium falciparum. We applied machine learning algorithms (Principal component analysis followed by K-means clustering) and Pearson correlation analysis to select 9 molecular descriptors (MDs) for model building. We selected and validated the three best QSAR models after execution of multiple linear regression (MLR) 100 times with different combinations of MDs. The developed models have fulfilled the five principles for QSAR models as specified by the Organization for Economic Co-operation and Development. The outcome of the study is a reliable and sustainable in silico method of IC50 (Mean ± SD) prediction that will positively impact the antimalarial drug development process by reducing the money and time required to identify potential antimalarial lead compounds from the class of flavonoids. We also developed a web tool (JazQSAR, https://etflin.com/news/4) to offer an easily accessible platform for the developed QSAR models.
科研通智能强力驱动
Strongly Powered by AbleSci AI