计算生物学
编码(社会科学)
人工神经网络
核糖核酸
计算机科学
非编码RNA
深度学习
深层神经网络
人工智能
长非编码RNA
基因
RNA序列
生物
机器学习
遗传学
基因表达
转录组
统计
数学
作者
Amira Kefi,Morris Chukhman,Vinayakumar Karintha,Sadok Bouamama,Jie Yang,Chunyu Liu
标识
DOI:10.1109/icbcb57893.2023.10246468
摘要
Recent advent of the second and third generation of sequencing has uncovered many novel transcripts. These novel transcripts could have crucial functions in different biological processes and might be related to challenging diseases and pathogenesis. However, whether these genes should be classified as protein coding RNAs (pcRNAs) or long non-coding RNAs (lncRNAs) is still debated and unclear. In this study we propose a coding potential classification framework based on deep neural networks and novel features from RNA-seq and Ribo-seq data to classify RNAs transcripts into protein coding and long non coding. As far as we know, this is the first method that uses RNA-seq and Ribo-seq as predictors to classify RNAs using a deep neural network model. Compared to other methods, the prediction of our method reached 97.4% accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI