Unix系统
计算机科学
软件
序列(生物学)
基因组学
序列比对
计算生物学
基因组
程序设计语言
生物
遗传学
基因
肽序列
作者
Maja Tarailo‐Graovac,Nansheng Chen
标识
DOI:10.1002/0471250953.bi0410s25
摘要
RepeatMasker is a popular software tool widely used in computational genomics to identify, classify, and mask repetitive elements, including low-complexity sequences and interspersed repeats. RepeatMasker searches for repetitive sequence by aligning the input genome sequence against a library of known repeats, such as Repbase. Here, we describe two Basic Protocols that provide detailed guidelines on how to use RepeatMasker, either via the Web interface or command-line Unix/Linux system, to analyze repetitive elements in genomic sequences. Sequence comparisons in RepeatMasker are usually performed by the alignment program cross_match, which requires significant processing time for larger sequences. An Alternate Protocol describes how to reduce the processing time using an alternative alignment program, such as WU-BLAST. Further, the advantages, limitations, and known bugs of the software are discussed. Finally, guidelines for understanding the results are provided.
科研通智能强力驱动
Strongly Powered by AbleSci AI