16S核糖体RNA
核糖体RNA
计算生物学
生物
基因组
霰弹枪测序
放大器
假阳性悖论
基因
遗传学
基因组
聚合酶链反应
计算机科学
人工智能
作者
Kristen Curry,Qi Wang,Michael Nute,Alona Tyshaieva,Elizabeth Reeves,Sirena Soriano,Qinglong Wu,Enid Graeber,Patrick Finzer,Werner Mendling,Tor Savidge,Sonia Villapol,Alexander Dilthey,Todd J. Treangen
出处
期刊:Nature Methods
[Springer Nature]
日期:2022-06-30
卷期号:19 (7): 845-853
被引量:121
标识
DOI:10.1038/s41592-022-01520-4
摘要
16S ribosomal RNA-based analysis is the established standard for elucidating the composition of microbial communities. While short-read 16S rRNA analyses are largely confined to genus-level resolution at best, given that only a portion of the gene is sequenced, full-length 16S rRNA gene amplicon sequences have the potential to provide species-level accuracy. However, existing taxonomic identification algorithms are not optimized for the increased read length and error rate often observed in long-read data. Here we present Emu, an approach that uses an expectation–maximization algorithm to generate taxonomic abundance profiles from full-length 16S rRNA reads. Results produced from simulated datasets and mock communities show that Emu is capable of accurate microbial community profiling while obtaining fewer false positives and false negatives than alternative methods. Additionally, we illustrate a real-world application of Emu by comparing clinical sample composition estimates generated by an established whole-genome shotgun sequencing workflow with those returned by full-length 16S rRNA gene sequences processed with Emu. Emu accurately estimates microbial abundance using full-length Nanopore 16S rRNA gene sequencing data.
科研通智能强力驱动
Strongly Powered by AbleSci AI