染色质
生物
增强子
计算生物学
序列(生物学)
特征(语言学)
卷积神经网络
基因组
DNA测序
遗传学
DNA
基因
计算机科学
转录因子
人工智能
语言学
哲学
作者
Qiao-Ying Ji,Xiujun Gong,Haoming Li,Pu-Feng Du
出处
期刊:Genomics
[Elsevier BV]
日期:2021-10-19
卷期号:113 (6): 4052-4060
被引量:11
标识
DOI:10.1016/j.ygeno.2021.10.007
摘要
Super-enhancer (SE) is a cluster of active typical enhancers (TE) with high levels of the Mediator complex, master transcriptional factors, and chromatin regulators. SEs play a key role in the control of cell identity and disease. Traditionally, scientists used a variety of high-throughput data of different transcriptional factors or chromatin marks to distinguish SEs from TEs. This kind of experimental methods are usually costly and time-consuming. In this paper, we proposed a model DeepSE, which is based on a deep convolutional neural network model, to distinguish the SEs from TEs. DeepSE represent the DNA sequences using the dna2vec feature embeddings. With only the DNA sequence information, DeepSE outperformed all state-of-the-art methods. In addition, DeepSE can be generalized well across different cell lines, which implied that cell-type specific SEs may share hidden sequence patterns across different cell lines. The source code and data are stored in GitHub (https://github.com/QiaoyingJi/DeepSE).
科研通智能强力驱动
Strongly Powered by AbleSci AI