序列(生物学)
计算机科学
折叠(DSP实现)
结构线形
回路建模
蛋白质结构
算法
肽序列
序列比对
生物系统
理论计算机科学
计算生物学
化学
蛋白质结构预测
生物
生物化学
工程类
基因
电气工程
作者
Armando D. Solis,S. Rackovsky
出处
期刊:Proteins
[Wiley]
日期:2000-02-01
卷期号:38 (2): 149-164
被引量:68
标识
DOI:10.1002/(sici)1097-0134(20000201)38:2<149::aid-prot4>3.0.co;2-#
摘要
In an effort to quantify loss of information in the processing of protein bioinformatic data, we examine how representations of amino acid sequence and backbone conformation affect the quantity of accessible structural information from local sequence. We propose a method to extract the maximum amount of peptide backbone structural information available in local sequence fragments, given a finite structural data set. Using methods of information theory, we develop an unbiased measure of local structural information that gauges changes in structural distributions when different representations of secondary structure and local sequence are used. We find that the manner in which backbone structure is represented affects the amount and quality of structural information that may be extracted from local sequence. Representations based on virtual bonds capture more structural information from local sequence than a three-state assignment scheme (helix/strand/loop). Furthermore, we find that amino acids show significant kinship with respect to the backbone structural information they carry, so that a collapse of the amino acid alphabet can be accomplished without severely affecting the amount of extractable information. This strategy is critical in optimizing the utility of a limited database of experimentally solved protein structures. Finally, we discuss the similarities within and differences between groups of amino acids in their roles in the local folding code and recognize specific amino acids critical in the formation of local structure.
科研通智能强力驱动
Strongly Powered by AbleSci AI