计算机科学
数据库搜索引擎
数据库
情报检索
计算生物学
搜索引擎
生物
作者
Zi Li,Chengxin Zhang,Qidi Zhang,Yang Zhang,Dong‐Jun Yu
标识
DOI:10.1021/acs.jcim.3c01455
摘要
The quickly increasing size of the Protein Data Bank is challenging biologists to develop a more scalable protein structure alignment tool for fast structure database search. Although many protein structure search algorithms and programs have been designed and implemented for this purpose, most require a large amount of computational time. We propose a novel protein structure search approach, TM-search, which is based on the pairwise structure alignment program TM-align and a new iterative clustering algorithm. Benchmark tests demonstrate that TM-search is 27 times faster than a TM-align full database search while still being able to identify ∼90% of all high TM-score hits, which is 2–10 times more than other existing programs such as Foldseek, Dali, and PSI-BLAST.
科研通智能强力驱动
Strongly Powered by AbleSci AI