开放式参考框架
生物
遗传学
基因组
同源(生物学)
枯草杆菌素
保守序列
序列比对
同源建模
多序列比对
古细菌
蛋白质家族
计算生物学
肽序列
基因
打开阅读框
生物化学
酶
作者
Roland J. Siezen,Bernadet Renckens,Jos Boekhorst
出处
期刊:Proteins
[Wiley]
日期:2007-03-08
卷期号:67 (3): 681-694
被引量:56
摘要
Abstract Subtilisin‐like serine proteases (subtilases) are a very diverse family of serine proteases with low sequence homology, often limited to regions surrounding the three catalytic residues. Starting with different Hidden Markov Models (HMM), based on sequence alignments around the catalytic residues of the S8 family (subtilisins) and S53 family (sedolisins), we iteratively searched all ORFs in the complete genomes of 313 eubacteria and archaea. In 164 genomes we identified a total of 567 ORFs with one or more of the conserved regions with a catalytic residue. The large majority of these contained all three regions around the “classical” catalytic residues of the S8 family (Asp‐His‐Ser), while 63 proteins were identified as S53 (sedolisin) family members (Glu‐Asp‐Ser). More than 30 proteins were found to belong to two novel subsets with other evolutionary variations in catalytic residues, and new HMMs were generated to search for them. In one subset the catalytic Asp is replaced by an equivalent Glu (i.e. Glu‐His‐Ser family). The other subset resembles sedolisins, but the conserved catalytic Asp is not located on the same helix as the nucleophile Glu, but rather on a β‐sheet strand in a topologically similar position, as suggested by homology modeling. The Prokaryotic Subtilase Database ( www.cmbi.ru.nl/subtilases ) provides access to all information on the identified subtilases, the conserved sequence regions, the proposed family subdivision, and the appropriate HMMs to search for them. Over 100 proteins were predicted to be subtilases for the first time by our improved searching methods, thereby improving genome annotation. Proteins 2007. © 2007 Wiley‐Liss, Inc.
科研通智能强力驱动
Strongly Powered by AbleSci AI