注释
管道(软件)
计算机科学
生物
德布鲁因图
计算生物学
工作流程
串联重复
DNA测序
遗传学
软件
基因组
基因
DNA
鉴定(生物学)
图形
数据库
人工智能
程序设计语言
理论计算机科学
植物
作者
Petr Novák,Pavel Neumann,Jir̆ı́ Macas
出处
期刊:Nature Protocols
[Springer Nature]
日期:2020-10-23
卷期号:15 (11): 3745-3776
被引量:196
标识
DOI:10.1038/s41596-020-0400-y
摘要
RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based clustering of next-generation sequencing reads for characterization of repetitive DNA in eukaryotes. The clustering algorithm facilitates repeat identification in any genome by using relatively small quantities of short sequence reads, and additional tools within the pipeline perform automatic annotation and quantification of the identified repeats. The pipeline is integrated into the Galaxy platform, which provides a user-friendly web interface for script execution and documentation of the results. Compared to the original version of the pipeline, RepeatExplorer2 provides automated annotation of transposable elements, identification of tandem repeats and enhanced visualization of analysis results. Here, we present an overview of the RepeatExplorer2 workflow and provide procedures for its application to (i) de novo repeat identification in a single species, (ii) comparative repeat analysis in a set of species, (iii) development of satellite DNA probes for cytogenetic experiments and (iv) identification of centromeric repeats based on ChIP-seq data. Each procedure takes approximately 2 d to complete. RepeatExplorer2 is available at https://repeatexplorer-elixir.cerit-sc.cz .
科研通智能强力驱动
Strongly Powered by AbleSci AI