计算机科学
水准点(测量)
人工智能
特征(语言学)
源代码
深度学习
鉴定(生物学)
编码(社会科学)
机器学习
数据挖掘
生物
数学
哲学
语言学
植物
统计
大地测量学
地理
操作系统
作者
Jianhua Jia,Rufeng Lei,Lulu Qin,Xin Wei
出处
期刊:BMC Genomics
[Springer Nature]
日期:2024-03-05
卷期号:25 (1)
标识
DOI:10.1186/s12864-024-10154-z
摘要
Abstract Background 5-Methylcytosine (5mC) plays a very important role in gene stability, transcription, and development. Therefore, accurate identification of the 5mC site is of key importance in genetic and pathological studies. However, traditional experimental methods for identifying 5mC sites are time-consuming and costly, so there is an urgent need to develop computational methods to automatically detect and identify these 5mC sites. Results Deep learning methods have shown great potential in the field of 5mC sites, so we developed a deep learning combinatorial model called i5mC-DCGA. The model innovatively uses the Convolutional Block Attention Module (CBAM) to improve the Dense Convolutional Network (DenseNet), which is improved to extract advanced local feature information. Subsequently, we combined a Bidirectional Gated Recurrent Unit (BiGRU) and a Self-Attention mechanism to extract global feature information. Our model can learn feature representations of abstract and complex from simple sequence coding, while having the ability to solve the sample imbalance problem in benchmark datasets. The experimental results show that the i5mC-DCGA model achieves 97.02%, 96.52%, 96.58% and 85.58% in sensitivity (Sn), specificity (Sp), accuracy (Acc) and matthews correlation coefficient (MCC), respectively. Conclusions The i5mC-DCGA model outperforms other existing prediction tools in predicting 5mC sites, and it is currently the most representative promoter 5mC site prediction tool. The benchmark dataset and source code for the i5mC-DCGA model can be found in https://github.com/leirufeng/i5mC-DCGA .
科研通智能强力驱动
Strongly Powered by AbleSci AI