结构变异
人类基因组
基因组
计算生物学
基因组学
计算机科学
序列(生物学)
错误发现率
DNA测序
生物
顺序装配
遗传学
基因
基因表达
转录组
作者
Luca Denti,Parsoa Khorsand,Paola Bonizzoni,Fereydoun Hormozdiari,Rayan Chikhi
出处
期刊:Nature Methods
[Springer Nature]
日期:2022-12-22
卷期号:20 (4): 550-558
被引量:12
标识
DOI:10.1038/s41592-022-01674-1
摘要
Structural variants (SVs) account for a large amount of sequence variability across genomes and play an important role in human genomics and precision medicine. Despite intense efforts over the years, the discovery of SVs in individuals remains challenging due to the diploid and highly repetitive structure of the human genome, and by the presence of SVs that vastly exceed sequencing read lengths. However, the recent introduction of low-error long-read sequencing technologies such as PacBio HiFi may finally enable these barriers to be overcome. Here we present SV discovery with sample-specific strings (SVDSS)—a method for discovery of SVs from long-read sequencing technologies (for example, PacBio HiFi) that combines and effectively leverages mapping-free, mapping-based and assembly-based methodologies for overall superior SV discovery performance. Our experiments on several human samples show that SVDSS outperforms state-of-the-art mapping-based methods for discovery of insertion and deletion SVs in PacBio HiFi reads and achieves notable improvements in calling SVs in repetitive regions of the genome. SVDSS combines multiple strategies to improve detection of structural variation using low-error long-read sequencing data.
科研通智能强力驱动
Strongly Powered by AbleSci AI