亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Deep convolutional neural networks for predicting leukemia-related transcription factor binding sites from DNA sequence data

卷积神经网络 DNA结合位点 支持向量机 人工智能 计算机科学 转录因子 深度学习 人工神经网络 机器学习 随机森林 模式识别(心理学) 计算生物学 基因 生物 发起人 遗传学 基因表达
作者
Jian He,Xuemei Pu,Menglong Li,Chuan Li,Yanzhi Guo
出处
期刊:Chemometrics and Intelligent Laboratory Systems [Elsevier BV]
卷期号:199: 103976-103976 被引量:7
标识
DOI:10.1016/j.chemolab.2020.103976
摘要

Transcription factors are proteins that could bind to specific DNA sequences so as to regulate gene expressions. Currently, identification of transcription factor binding sites locating in DNA sequences is very important for building regulatory model in biological systems and identifying pathogenic variations. Traditional machine-learning methods have been successfully used for biological prediction problems based on DNA or protein sequences, but they all need to manually extract numerical features, which is not only tedious, but also would ignore effective information of first-order sequences. In this paper, based on the principle of deep learning (DL), we constructed prediction model for transcription factor binding sites only from DNA original base sequences. Here, a DL method based on convolutional neural network (CNN) and long short-term memory (LSTM) were proposed to investigate four leukemia categories from the perspective of transcription factor binding sites using four large non-redundant datasets for acute, chronic, myeloid and lymphatic leukemia, respectively. Compared with three widely used machine-learning methods of artificial neural network (ANN), support vector machine (SVM) and random forest (RF), our DL method exhibits significant superiority in terms of prediction performance, since the prediction accuracy of three machine-learning models either based on sequence feature or k-mer feature extraction are all lower than that of DL model. The available DL models for four leukemia categories gives an average prediction accuracy of 75% based only on sequence segments with 101 bases, which indicates that the DL based method is promising with unique advantages over the traditional machine learning methods. But focusing on leukemia-related transcription factor binding site prediction, further improvements would be implemented such as optimizing base segment length and CNN architecture, in order to improve the current prediction accuracy.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
王JT发布了新的文献求助10
4秒前
5秒前
10秒前
kiki0808发布了新的文献求助10
15秒前
CodeCraft应助guo采纳,获得10
32秒前
计划完成签到,获得积分10
39秒前
1分钟前
科研通AI2S应助科研通管家采纳,获得10
1分钟前
英俊的铭应助科研通管家采纳,获得10
1分钟前
科研通AI2S应助科研通管家采纳,获得10
1分钟前
1分钟前
Benhnhk21完成签到,获得积分10
1分钟前
Stella完成签到,获得积分10
1分钟前
1分钟前
美好向松发布了新的文献求助10
2分钟前
2分钟前
端庄亦巧发布了新的文献求助10
2分钟前
Everything完成签到,获得积分10
2分钟前
充电宝应助美好向松采纳,获得10
2分钟前
2分钟前
2分钟前
Hh发布了新的文献求助10
2分钟前
田様应助quxiaofei采纳,获得10
2分钟前
3分钟前
xp1911完成签到,获得积分10
3分钟前
科目三应助Hh采纳,获得10
3分钟前
端庄亦巧发布了新的文献求助10
3分钟前
3分钟前
慕青应助云间山很困采纳,获得10
3分钟前
3分钟前
3分钟前
大个应助无语采纳,获得10
3分钟前
3分钟前
无语发布了新的文献求助10
4分钟前
王JT完成签到,获得积分10
4分钟前
4分钟前
4分钟前
顾矜应助NattyPoe采纳,获得10
4分钟前
4分钟前
领导范儿应助Cheffe采纳,获得10
4分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 2000
Cronologia da história de Macau 1600
BRITTLE FRACTURE IN WELDED SHIPS 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
Developmental Peace: Theorizing China’s Approach to International Peacebuilding 1000
Traitements Prothétiques et Implantaires de l'Édenté total 2.0 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6135619
求助须知:如何正确求助?哪些是违规求助? 7962770
关于积分的说明 16526263
捐赠科研通 5251060
什么是DOI,文献DOI怎么找? 2803903
邀请新用户注册赠送积分活动 1784913
关于科研通互助平台的介绍 1655503