基因组
生物
计算生物学
生物群落
肠道微生物群
构造(python库)
序列(生物学)
计算机科学
生物信息学
进化生物学
微生物群
遗传学
生态学
基因
生态系统
程序设计语言
作者
Pengshuo Yang,Wei Zheng,Kang Ning,Yang Zhang
标识
DOI:10.1073/pnas.2110828118
摘要
Significance Metagenome sequencing provides a useful repository to extract evolutionary information and assist protein structure predictions. The sequence-search process, however, becomes increasingly prohibitive due to the huge library size. We hypothesize that there exist inherent evolutionary linkages between microbial niches and protein families that can be used to construct precise multiple sequence alignments (MSAs). To examine the hypothesis, we built a model library of four major biomes containing 4.25 billion sequences. Large-scale protein folding experiments revealed that MSAs collected from individually linked microbiomes can generate more accurate contact and structure models than those from all microbiome sequences but use significantly fewer computing resources. These results demonstrate the potential to solve the metagenome-search problem using a microbiome targeted approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI