阿达布思
人工智能
计算机科学
机器学习
判别式
支持向量机
分类器(UML)
模式识别(心理学)
集成学习
特征(语言学)
语言学
哲学
作者
Shahid Akbar,Farman Ali,Maqsood Hayat,Ashfaq Ahmad,Salman Khan,Sarah Gul
标识
DOI:10.1016/j.chemolab.2022.104682
摘要
Viral diseases are a major health concern in the last few years. Antiviral peptides (AVPs) belong to a type of antimicrobial peptides (AMPs) that have the high potential to defend the human body from various viral diseases. Despite the large production of antiviral vaccination and drugs, viral infections are still a prominent human disease. The discovery of AVPs as an antiviral agent offers an effective way to treat virus-affected cells. Recently, the development of peptide-based therapeutic agents via machine learning methods is becoming a major area of interest due to its promising results. In this paper, we developed an intelligent and computationally efficient learning approach for the reliable identification of AVPs. The novel evolutionary descriptors are explored via embedding discrete wavelet transform and k-segmentation approaches into the position-specific scoring matrix. Moreover, the Shapley Additive exPlanations (SHAP) based global interpretation analysis is employed to choose optimal features by measuring the contributions of each feature in the extracted vectors. In the next phase, the selected feature spaces are examined using five different classifiers, such as XGBoost (XGB), k-nearest neighbor (KNN), Extra Trees classifier (ETC), Support Vector Machine (SVM), and Adaboost (ADA). Furthermore, to boost the discriminative power of the proposed model, the predicted labels of all classifiers are given to the optimized genetic algorithm to build an ensemble learner. Hence, our proposed study reported a higher classification rate of 97.33% and 95.57% via training samples and independent samples, respectively. Which is ∼5% improved accuracy than available predictors. It is recommended that our model will be a helpful approach for the researchers and may perform a significant role in research academia and drug development. The source code and all datasets are publicly available at https://github.com/wangphd0/pAVP_PSSMDWT-EnC.
科研通智能强力驱动
Strongly Powered by AbleSci AI