基因组
基因组
猎枪
计算生物学
生物
霰弹枪测序
顺序装配
Python(编程语言)
基因
遗传学
计算机科学
操作系统
基因表达
转录组
作者
Sara Saheb Kashaf,Alexandre Almeida,Julia A. Segre,ROBERT FINN
出处
期刊:Nature Protocols
[Springer Nature]
日期:2021-04-16
卷期号:16 (5): 2520-2541
被引量:35
标识
DOI:10.1038/s41596-021-00508-2
摘要
Recovering genomes from shotgun metagenomic sequence data allows detailed taxonomic and functional characterization of individual species or strains in a microbial community. Retrieving these metagenome-assembled genomes (MAGs) involves seven stages. First, low-quality bases, along with adapter and host sequences, are removed. Second, overlapping sequences are assembled to create longer contiguous fragments. Third, these fragments are clustered based on sequence composition and abundance. Fourth, these sequence clusters, or bins, undergo rounds of quality assessment and refinement to yield MAGs. The optional fifth stage is dereplication of MAGs to select representatives. Next, each MAG is taxonomically classified. The optional seventh stage is assessing the fraction of diversity that has been recovered. The output of this protocol is draft genomes, which can provide invaluable clues about uncultured organisms. This protocol takes ~1 week to run, depending on computational resources available, and requires prior experience with high-performance computing, shell script programming and Python. This protocol recovers prokaryotic genomes from short-read metagenomic data. De novo assembly and binning are used to reconstruct metagenome-assembled genomes, which after quality assessment can provide insights into uncultured bacterial species.
科研通智能强力驱动
Strongly Powered by AbleSci AI