超家族
计算机科学
多样性(控制论)
互联网
计算生物学
集合(抽象数据类型)
蛋白质工程
多序列比对
数据科学
情报检索
生物信息学
序列比对
数据挖掘
万维网
生物
人工智能
肽序列
酶
遗传学
生物化学
程序设计语言
受体
基因
作者
Remko Kuipers,Henk-Jan Joosten,Willem J. H. van Berkel,Nicole G. H. Leferink,Erik Rooijen,Erik Ittmann,Frank van Zimmeren,Helge Jochens,Uwe T. Bornscheuer,Gert Vriend,Vítor A. P. Martins dos Santos,Peter J. Schaap
出处
期刊:Proteins
[Wiley]
日期:2010-01-01
卷期号:: NA-NA
被引量:147
摘要
Ten years of experience with molecular class-specific information systems (MCSIS) such as with the hand-curated G protein-coupled receptor database (GPCRDB) or the semiautomatically generated nuclear receptor database has made clear that a wide variety of questions can be answered when protein-related data from many different origins can be flexibly combined. MCSISes revolve around a multiple sequence alignment (MSA) that includes "all" available sequences from the entire superfamily, and it has been shown at many occasions that the quality of these alignments is the most crucial aspect of the MCSIS approach. We describe here a system called 3DM that can automatically build an entire MCSIS. 3DM bases the MSA on a multiple structure alignment, which implies that the availability of a large number of superfamily members with a known three-dimensional structure is a requirement for 3DM to succeed well. Thirteen MCSISes were constructed and placed on the Internet for examination. These systems have been instrumental in a large series of research projects related to enzyme activity or the understanding and engineering of specificity, protein stability engineering, DNA-diagnostics, drug design, and so forth.
科研通智能强力驱动
Strongly Powered by AbleSci AI