搜索引擎索引
计算机科学
基因组
索引(排版)
计算生物学
参考基因组
自由序列分析
序列比对
数据挖掘
生物
遗传学
人工智能
基因
万维网
肽序列
作者
Daehwan Kim,Ben Langmead,Steven L. Salzberg
出处
期刊:Nature Methods
[Springer Nature]
日期:2015-03-09
卷期号:12 (4): 357-360
被引量:18371
摘要
HISAT (hierarchical indexing for spliced alignment of transcripts) uses global and local indices for fast, sensitive alignment with small memory requirements. HISAT (hierarchical indexing for spliced alignment of transcripts) is a highly efficient system for aligning reads from RNA sequencing experiments. HISAT uses an indexing scheme based on the Burrows-Wheeler transform and the Ferragina-Manzini (FM) index, employing two types of indexes for alignment: a whole-genome FM index to anchor each alignment and numerous local FM indexes for very rapid extensions of these alignments. HISAT's hierarchical index for the human genome contains 48,000 local FM indexes, each representing a genomic region of ∼64,000 bp. Tests on real and simulated data sets showed that HISAT is the fastest system currently available, with equal or better accuracy than any other method. Despite its large number of indexes, HISAT requires only 4.3 gigabytes of memory. HISAT supports genomes of any size, including those larger than 4 billion bases.
科研通智能强力驱动
Strongly Powered by AbleSci AI