计算机科学
Perl公司
管道(软件)
软件
注释
过程(计算)
数据挖掘
文件格式
数据库
软件工程
程序设计语言
人工智能
作者
Wenke Li,Li Fengyu,Siyao Zhang,Bin Cai,Na Zheng,Yu Nie,Zhou Dao,Zhao Qian
出处
期刊:PubMed
日期:2014-06-01
卷期号:36 (6): 618-24
被引量:1
标识
DOI:10.3724/sp.j.1005.2014.0618
摘要
The development of next-generation sequencing has generated high demand for data processing and analysis. Although there are a lot of software for analyzing next-generation sequencing data, most of them are designed for one specific function (e.g., alignment, variant calling or annotation). Therefore, it is necessary to combine them together for data analysis and to generate interpretable results for biologists. This study designed a pipeline to process Illumina sequencing data based on Perl programming language and SGE system. The pipeline takes original sequence data (fastq format) as input, calls the standard data processing software (e.g., BWA, Samtools, GATK, and Annovar), and finally outputs a list of annotated variants that researchers can further analyze. The pipeline simplifies the manual operation and improves the efficiency by automatization and parallel computation. Users can easily run the pipeline by editing the configuration file or clicking the graphical interface. Our work will facilitate the research projects using the sequencing technology.
科研通智能强力驱动
Strongly Powered by AbleSci AI