计算机科学
人工智能
编码
人工神经网络
深度学习
编码(内存)
特征(语言学)
鉴定(生物学)
DNA
支持向量机
DNA测序
机器学习
计算生物学
5-甲基胞嘧啶
序列(生物学)
模式识别(心理学)
生物
遗传学
基因
DNA甲基化
语言学
哲学
植物
基因表达
标识
DOI:10.1109/isbp57705.2023.10061304
摘要
DNA modification is closely related to the expression genetics of many organisms, therefore, the prediction of DNA modification sites is particularly important. In this paper, we use deep learning techniques to identify and predict DNA N4-methylcytosine modification sites, and the main work is as follows. Feature encoding using k-spacer nucleic acids to encode a 41 bp long DNA sequence as a (41×9) dimensional vector. Recognition prediction based on multi-headed attention mechanism and GRU neural network. Firstly, the encoded data are extracted and downscaled; secondly, the importance distribution of 4mc loci and each nucleotide in the sequence are further extracted adaptively using the multi-headed attention mechanism; then the GRU network is used to capture the long dependencies in the whole importance distribution; finally, a new prediction model of 4mc loci is constructed using two fully connected layers, and its recognition accuracy is significantly improved compared with other basic machine learning models. The recognition accuracy is improved compared with other basic machine learning models.
科研通智能强力驱动
Strongly Powered by AbleSci AI