增强子
计算生物学
生物
调节顺序
染色质
发起人
转录因子
基因组
非编码DNA
组蛋白
DNA测序
DNA结合位点
遗传学
基因
计算机科学
基因表达
作者
Yifeng Li,Chih‐Yu Chen,Alice M. Kaye,Wyeth W. Wasserman
出处
期刊:BioSystems
[Elsevier]
日期:2015-10-24
卷期号:138: 6-17
被引量:54
标识
DOI:10.1016/j.biosystems.2015.10.002
摘要
The majority of the human genome consists of non-coding regions that have been called junk DNA. However, recent studies have unveiled that these regions contain cis-regulatory elements, such as promoters, enhancers, silencers, insulators, etc. These regulatory elements can play crucial roles in controlling gene expressions in specific cell types, conditions, and developmental stages. Disruption to these regions could contribute to phenotype changes. Precisely identifying regulatory elements is key to deciphering the mechanisms underlying transcriptional regulation. Cis-regulatory events are complex processes that involve chromatin accessibility, transcription factor binding, DNA methylation, histone modifications, and the interactions between them. The development of next-generation sequencing techniques has allowed us to capture these genomic features in depth. Applied analysis of genome sequences for clinical genetics has increased the urgency for detecting these regions. However, the complexity of cis-regulatory events and the deluge of sequencing data require accurate and efficient computational approaches, in particular, machine learning techniques. In this review, we describe machine learning approaches for predicting transcription factor binding sites, enhancers, and promoters, primarily driven by next-generation sequencing data. Data sources are provided in order to facilitate testing of novel methods. The purpose of this review is to attract computational experts and data scientists to advance this field.
科研通智能强力驱动
Strongly Powered by AbleSci AI