计算生物学
铁载体
基因组
生物
功能(生物学)
序列(生物学)
次生代谢物
管道(软件)
全基因组测序
次生代谢
系统发育树
系统发育学
细菌
进化生物学
生物合成
基因
遗传学
计算机科学
程序设计语言
作者
Shaohua Gu,Yuanzhe Shao,Karoline Rehm,Laurent Bigler,Di Zhang,Ruolin He,Ruichen Xu,Jiqi Shao,Alexandre Jousset,Ville‐Petri Friman,Xiaoying Bian,Zhong Wei,Rolf Kümmerli,Zhiyuan Li
标识
DOI:10.7554/elife.96719.2
摘要
Microbial secondary metabolites are a rich source for pharmaceutical discoveries and play crucial ecological functions. While tools exist to identify secondary metabolite clusters in genomes, precise sequence-to-function mapping remains challenging because neither function nor substrate specificity of biosynthesis enzymes can accurately be predicted. Here we developed a knowledge-guided bioinformatic pipeline to solve these issues. We analyzed 1928 genomes of Pseudomonas bacteria and focused on iron-scavenging pyoverdines as model metabolites. Our pipeline predicted 188 chemically different pyoverdines with nearly 100% structural accuracy and the presence of 94 distinct receptor groups required for the uptake of iron-loaded pyoverdines. Our pipeline unveils an enormous yet overlooked diversity of siderophores (151 new structures) and receptors (91 new groups). Our approach, combining feature sequence with phylogenetic approaches, is extendable to other metabolites and microbial genera, and thus emerges as powerful tool to reconstruct bacterial secondary metabolism pathways based on sequence data.
科研通智能强力驱动
Strongly Powered by AbleSci AI