计算机科学
鉴定(生物学)
结构变异
计算生物学
领域(数学)
滤波器(信号处理)
数据挖掘
基因组
生物
遗传学
基因
计算机视觉
数学
植物
纯数学
作者
Fritz J. Sedlazeck,Philipp Rescheneder,Moritz Smolka,Han Fang,Maria Nattestad,Arndt von Haeseler,Michael C. Schatz
出处
期刊:Nature Methods
[Springer Nature]
日期:2018-04-27
卷期号:15 (6): 461-468
被引量:1320
标识
DOI:10.1038/s41592-018-0001-7
摘要
Structural variations are the greatest source of genetic variation, but they remain poorly understood because of technological limitations. Single-molecule long-read sequencing has the potential to dramatically advance the field, although high error rates are a challenge with existing methods. Addressing this need, we introduce open-source methods for long-read alignment (NGMLR; https://github.com/philres/ngmlr ) and structural variant identification (Sniffles; https://github.com/fritzsedlazeck/Sniffles ) that provide unprecedented sensitivity and precision for variant detection, even in repeat-rich regions and for complex nested events that can have substantial effects on human health. In several long-read datasets, including healthy and cancerous human genomes, we discovered thousands of novel variants and categorized systematic errors in short-read approaches. NGMLR and Sniffles can automatically filter false events and operate on low-coverage data, thereby reducing the high costs that have hindered the application of long reads in clinical and research settings. NGMLR and Sniffles perform highly accurate alignment and structural variation detection from long-read sequencing data.
科研通智能强力驱动
Strongly Powered by AbleSci AI