序列(生物学)
计算机科学
蛋白质测序
蛋白质功能预测
代表(政治)
人工智能
模式识别(心理学)
集合(抽象数据类型)
蛋白质结构预测
编码(社会科学)
数据挖掘
蛋白质功能
蛋白质结构
肽序列
数学
生物
遗传学
基因
统计
生物化学
政治
政治学
法学
程序设计语言
作者
Lei Yang,Junfeng Xia,Jie Gui
出处
期刊:Protein and Peptide Letters
[Bentham Science]
日期:2010-09-01
卷期号:17 (9): 1085-1090
被引量:161
标识
DOI:10.2174/092986610791760306
摘要
With a huge amount of protein sequence data, the computational method for protein – protein interaction (PPI) prediction using only the protein sequences information have drawn increasing interest. In this article, we propose a sequence- based method based on a novel representation of local protein sequence descriptors. Local descriptors account for the interactions between residues in both continuous and discontinuous regions of a protein sequence, so this method enables us to extract more PPI information from the sequence. A series of elaborate experiments are performed to optimize the prediction model by varying the parameter k and the distance measuring function of the k-nearest neighbors learning system and the ways of coding a protein pair. When performed on the PPI data of Saccharomyces cerevisiae, the method achieved 86.15% prediction accuracy with 81.03% sensitivity at the precision of 90.24%. An independent data set of 986 Escherichia coli PPIs was used to evaluate this prediction model and the prediction accuracy is 73.02%. Given the complex nature of PPIs, the performance of our method is promising, and it can be a helpful supplement for PPIs prediction. Keywords: Feature representation, KNNs, local descriptors, PPIs prediction, protein sequence, sequence-based method
科研通智能强力驱动
Strongly Powered by AbleSci AI