子类
随机森林
序列(生物学)
酶
随机矩阵
序列母题
数学
计算机科学
计算生物学
组合数学
生物
生物化学
人工智能
特征向量
物理
遗传学
抗体
量子力学
DNA
作者
Ying Wang,Xiuzhen Hu,Lixia Sun,Zhenxing Feng,Hangyu Song
出处
期刊:Protein and Peptide Letters
[Bentham Science]
日期:2014-01-31
卷期号:21 (3): 275-284
被引量:4
标识
DOI:10.2174/09298665113206660114
摘要
In order to predict enzyme subclasses, this paper builds a new enzyme database in term of previous ideas and methods. Based on protein sequence, by selecting increment of diversity value, low-frequency of power spectral density, matrix scoring values and motif frequency as characteristic parameters to describe the sequence information, a Random Forest algorithm for predicting enzyme subclass is proposed. Using the Jack-knife test, the overall success rate identifying the 18 subclasses of oxidoreductases, the 8 subclasses of transferases, the 5 subclasses of hydrolases, the 6 subclasses of lyases, the 6 subclasses of isomerases, and the 6 subclasses of ligases are 90.86%, 95.24%, 96.42%, 98.60%, 97.53% and 98.03%. Furthermore, the same way is used to the previous database, the better results are obtained.
科研通智能强力驱动
Strongly Powered by AbleSci AI