三肽
随机森林
人工智能
支持向量机
感知器
抗氧化剂
多层感知器
梯度升压
计算机科学
机器学习
伪氨基酸组成
肽
化学
生物系统
生物化学
二肽
生物
人工神经网络
作者
Nanxiang Yang,Yongyan Pei,Yan Wang,Limin Zhao,Ping Zhao,Xiaoyong Zou
标识
DOI:10.1016/j.chemolab.2023.104845
摘要
Oxidative stress can lead to various diseases, so achieving antioxidation is crucial for combating these diseases. However, the study of antioxidant mechanism is still in its early stage. Most current methods measure the antioxidant properties of polypeptides using experimental approaches, which are typically time-consuming and laborious. It is therefore urgent to develop a theoretical method to identify antioxidant activity based on peptide sequence information. In this study, amino acid composition, dipeptide composition, and “one-hot” vectors were used to characterize the sequence information of tripeptides. Machine learning methods such as support vector machine, random forest, gradient tree boosting and multi-layer perceptron were utilized to construct models for identifying their antioxidant activities. For the five collected datasets, the Q2 of the random forest model reached 0.864, 0.903, 0.867, 0.611, and 0.703, respectively. Compared with the existing methods 0.061, 0.105, 0.028, 0.121 and 0.111 improvements have been obtained, demonstrating the effectiveness of the current approach. The developed method is expected to significantly contribute to the study of polypeptide antioxidant mechanisms.
科研通智能强力驱动
Strongly Powered by AbleSci AI