支持向量机
伪氨基酸组成
模式识别(心理学)
人工智能
计算机科学
蛋白质测序
相似性(几何)
刀切重采样
水准点(测量)
特征向量
结构相似性
蛋白质法
分类器(UML)
数据挖掘
二肽
机器学习
数学
肽序列
生物
氨基酸
统计
生物化学
图像(数学)
基因
估计员
地理
大地测量学
作者
Taigang Liu,Xiaoqi Zheng,Jun Wang
出处
期刊:Biochimie
[Elsevier]
日期:2010-10-01
卷期号:92 (10): 1330-1334
被引量:119
标识
DOI:10.1016/j.biochi.2010.06.013
摘要
Knowledge of structural class plays an important role in understanding protein folding patterns. In this study, a simple and powerful computational method, which combines support vector machine with PSI-BLAST profile, is proposed to predict protein structural class for low-similarity sequences. The evolution information encoding in the PSI-BLAST profiles is converted into a series of fixed-length feature vectors by extracting amino acid composition and dipeptide composition from the profiles. The resulting vectors are then fed to a support vector machine classifier for the prediction of protein structural class. To evaluate the performance of the proposed method, jackknife cross-validation tests are performed on two widely used benchmark datasets, 1189 (containing 1092 proteins) and 25PDB (containing 1673 proteins) with sequence similarity lower than 40% and 25%, respectively. The overall accuracies attain 70.7% and 72.9% for 1189 and 25PDB datasets, respectively. Comparison of our results with other methods shows that our method is very promising to predict protein structural class particularly for low-similarity datasets and may at least play an important complementary role to existing methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI