基因组
纳米孔测序
拉伤
单倍型
生物
基因组
移相器
计算生物学
纳米孔
遗传学
基因
基因型
纳米技术
材料科学
物理
光学
解剖
作者
E. A. Kazantseva,Ataberk Donmez,M. S. Frolova,Mihai Pop,Mikhail Kolmogorov
标识
DOI:10.1101/2023.01.31.526521
摘要
Abstract Bacterial species in microbial communities are often represented by mixtures of strains, distinguished by small variations in their genomes. Despite the functional importance of intra-strain heterogeneity, its characterization from metagenomic sequencing data remains challenging. Short-read approaches can be used to detect small-scale variation between strains, but fail to phase these variants into contiguous haplotypes. Long-read metagenome assemblers can generate contiguous bacterial chromosomes, but often suppress strain-level variation in favor of species-level consensus. Here we present Strainy - an algorithm for strain-level metagenome assembly and phasing from Nanopore and HiFi reads. Strainy takes a de novo metagenomic assembly as input, identifies strain variants which are then phased and assembled into contiguous haplotypes. Using simulated and mock Nanopore and HiFi metagenome data, we show that Strainy assembles accurate and complete strain haplotypes, outperforming current Nanopore-based methods and comparable with HiFi-based algorithms in completeness and accuracy. We then use Strainy to assemble strain haplotypes of a complex environmental metagenome, revealing distinct mutational patterns in bacterial species.
科研通智能强力驱动
Strongly Powered by AbleSci AI