计算机科学
预测建模
机器学习
人工智能
抗冻蛋白
数据挖掘
化学
生物化学
作者
Nishant Kumar,Shubham Choudhury,Nisha Bajiya,Sumeet Patiyal,Gajendra P. S. Raghava
标识
DOI:10.1002/pmic.202400157
摘要
ABSTRACT Prediction of antifreeze proteins (AFPs) holds significant importance due to their diverse applications in healthcare. An inherent limitation of current AFP prediction methods is their reliance on unreviewed proteins for evaluation. This study evaluates, proposed and existing methods on an independent dataset containing 80 AFPs and 73 non‐AFPs obtained from Uniport, which have been already reviewed by experts. Initially, we constructed machine learning models for AFP prediction using selected composition‐based protein features and achieved a peak AUROC of 0.90 with an MCC of 0.69 on the independent dataset. Subsequently, we observed a notable enhancement in model performance, with the AUROC increasing from 0.90 to 0.93 upon incorporating evolutionary information instead of relying solely on the primary sequence of proteins. Furthermore, we explored hybrid models integrating our machine learning approaches with BLAST‐based similarity and motif‐based methods. However, the performance of these hybrid models either matched or was inferior to that of our best machine‐learning model. Our best model based on evolutionary information outperforms all existing methods on independent/validation dataset. To facilitate users, a user‐friendly web server with a standalone package named “AFPropred” was developed ( https://webs.iiitd.edu.in/raghava/afpropred ).
科研通智能强力驱动
Strongly Powered by AbleSci AI