基因组
计算机科学
生物
数据挖掘
参考基因组
软件
搜索引擎索引
基因组
计算生物学
情报检索
遗传学
基因
程序设计语言
作者
Jens-Uwe Ulrich,Bernhard Y. Renard
出处
期刊:Genome Research
[Cold Spring Harbor Laboratory]
日期:2024-06-01
卷期号:34 (6): 914-924
标识
DOI:10.1101/gr.278623.123
摘要
Metagenomic long-read sequencing is gaining popularity for various applications, including pathogen detection and microbiome studies. To analyze the large data created in those studies, software tools need to taxonomically classify the sequenced molecules and estimate the relative abundances of organisms in the sequenced sample. Because of the exponential growth of reference genome databases, the current taxonomic classification methods have large computational requirements. This issue motivated us to develop a new data structure for fast and memory-efficient querying of long reads. Here, we present Taxor as a new tool for long-read metagenomic classification using a hierarchical interleaved XOR filter data structure for indexing and querying large reference genome sets. Taxor implements several
科研通智能强力驱动
Strongly Powered by AbleSci AI