生物
GenBank公司
原生生物
系统发育树
系统发育学
门
背景(考古学)
核糖体RNA
元数据
进化生物学
计算生物学
基因
遗传学
计算机科学
古生物学
操作系统
作者
Vittorio Boscaro,Luciana F. Santoferrara,Qianqian Zhang,Eleni Gentekaki,Mitchell J. Syberg-Olsen,Javier del Campo,Patrick J. Keeling
标识
DOI:10.1111/1462-2920.14264
摘要
High-throughput sequencing (HTS) surveys, among the most common approaches currently used in environmental microbiology, require reliable reference databases to be correctly interpreted. The EukRef Initiative (eukref.org) is a community effort to manually screen available small subunit (SSU) rRNA gene sequences and produce a public, high-quality and informative framework of phylogeny-based taxonomic annotations. In the context of EukRef, we present a database for the monophyletic phylum Ciliophora, one of the most complex, diverse and ubiquitous protist groups. We retrieved more than 11 500 sequences of ciliates present in GenBank (28% from identified isolates and 72% from environmental surveys). Our approach included the inference of phylogenetic trees for every ciliate lineage and produced the largest SSU rRNA tree of the phylum Ciliophora to date. We flagged approximately 750 chimeric or low-quality sequences, improved the classification of 70% of GenBank entries and enriched environmental and literature metadata by 30%. The performance of EukRef-Ciliophora is superior to the current SILVA database in classifying HTS reads from a global marine survey. Comprehensive outputs are publicly available to make the new tool a useful guide for non-specialists and a quick reference for experts.
科研通智能强力驱动
Strongly Powered by AbleSci AI