增强子
计算生物学
卷积神经网络
计算机科学
人工智能
鉴定(生物学)
拟南芥
人工神经网络
深度学习
功能(生物学)
集合(抽象数据类型)
机器学习
生物
模式识别(心理学)
基因
遗传学
转录因子
植物
突变体
程序设计语言
作者
Yiqiong Chen,Yujia Gao,Hejie Zhou,Yanming Zuo,Youhua Zhang,Zhenyu Yue
出处
期刊:Current Bioinformatics
[Bentham Science]
日期:2021-11-24
卷期号:17 (6): 531-540
被引量:3
标识
DOI:10.2174/1574893616666211123094301
摘要
Background: Enhancers are key cis-function elements of DNA structure that are crucial in gene regulation and the function of a promoter in eukaryotic cells. Availability of accurate identification of the enhancers would facilitate the understanding of DNA functions and their physiological roles. Previous studies have revealed the effectiveness of computational methods for identifying enhancers in other organisms. To date, a huge number of enhancers remain unknown, especially in the field of plant species. Objective: In this study, the aim is to build an efficient attention-based neural network model for the identification of Arabidopsis thaliana enhancers. Method: A sequence-based model using convolutional and recurrent neural networks was proposed for the identification of enhancers. The input DNA sequences are represented as feature vectors by 4-mer. A neural network model consists of CNN and Bi-RNN as sequence feature extractors, and the attention mechanism is suggested to improve the prediction performance. Results: We implemented an ablation study on validation set to select and evaluate the effectiveness of our proposed model. Moreover, our model showed remarkable performance on the test set achieving the Mcc of 0.955, the AUPRC of 0.638, and the AUROC of 0.837, which are significantly higher than state-of-the-art methods, respectively. Conclusion: The proposed computational framework aims at solving similar problems in non-coding genomic regions, thereby providing valuable insights into the prediction about the enhancers of plants.
科研通智能强力驱动
Strongly Powered by AbleSci AI