作者
Suguru Nishijima,Evelina Stankevič,Oliver Aasmets,Thomas Schmidt,Naoyoshi Nagata,M. Keller,Pamela Ferretti,Helene Bæk Juel,Anthony Fullam,Shahriyar Mahdi Robbani,Christian Schudoma,Johanne Kragh Hansen,Louise Aas Holm,Mads Israelsen,Robert Schierwagen,Nikolaj Torp,Manimozhiyan Arumugam,Flemming Bendtsen,Charlotte Brøns,Cilius Esmann Fonvig,Jens‐Christian Holm,Trine Nielsen,Julie Steen Pedersen,Maja Thiele,Jonel Trebicka,Elin Org,Aleksander Krag,Torben Hansen,Michael Kuhn,Peer Bork
摘要
Abstract The microbiota in individual habitats differ both in relative composition and absolute abundance. While sequencing approaches determine only the relative abundances of taxa and genes, experimental techniques for absolute abundance determination are rarely applied to large-scale microbiome studies. Here, we developed a machine learning approach to predict fecal microbial loads (microbial cells per gram) solely from relative abundance data. Applied to large-scale datasets (n = 34,539), we demonstrate that microbial load is the major determinant of gut microbiome variation and associated with numerous host factors. We found that for several diseases, the altered microbial load, not the disease itself, was the main driver of the gut microbiome changes. Adjusting for this effect substantially reduced the significance of more than half of the disease-associated species. Our analysis reveals that the fecal microbial load is a major confounder in microbiome studies, highlighting its importance for understanding microbiome variation in health and disease.