Secondary structure prediction of long noncoding RNA: review and experimental comparison of existing approaches

计算机科学 水准点(测量) 源代码 公制(单位) 核酸二级结构 机器学习 人工智能 功能(生物学) 蛋白质二级结构 核糖核酸 核酸结构 计算生物学 数据挖掘 生物 工程类 生物化学 运营管理 大地测量学 进化生物学 基因 地理 操作系统
作者
Leandro A. Bugnon,Alejando A Edera,Santiago Prochetto,M. Gérard,Jonathan Raad,Emilio Fenoy,María Florencia Rubiolo,Uciel Chorostecki,Toni Gabaldón,Federico Ariel,Leandro E. Di Persia,Diego H. Milone,Georgina Stegmayer
出处
期刊:Briefings in Bioinformatics [Oxford University Press]
卷期号:23 (4) 被引量:9
标识
DOI:10.1093/bib/bbac205
摘要

Abstract Motivation In contrast to messenger RNAs, the function of the wide range of existing long noncoding RNAs (lncRNAs) largely depends on their structure, which determines interactions with partner molecules. Thus, the determination or prediction of the secondary structure of lncRNAs is critical to uncover their function. Classical approaches for predicting RNA secondary structure have been based on dynamic programming and thermodynamic calculations. In the last 4 years, a growing number of machine learning (ML)-based models, including deep learning (DL), have achieved breakthrough performance in structure prediction of biomolecules such as proteins and have outperformed classical methods in short transcripts folding. Nevertheless, the accurate prediction for lncRNA still remains far from being effectively solved. Notably, the myriad of new proposals has not been systematically and experimentally evaluated. Results In this work, we compare the performance of the classical methods as well as the most recently proposed approaches for secondary structure prediction of RNA sequences using a unified and consistent experimental setup. We use the publicly available structural profiles for 3023 yeast RNA sequences, and a novel benchmark of well-characterized lncRNA structures from different species. Moreover, we propose a novel metric to assess the predictive performance of methods, exclusively based on the chemical probing data commonly used for profiling RNA structures, avoiding any potential bias incorporated by computational predictions when using dot-bracket references. Our results provide a comprehensive comparative assessment of existing methodologies, and a novel and public benchmark resource to aid in the development and comparison of future approaches. Availability Full source code and benchmark datasets are available at: https://github.com/sinc-lab/lncRNA-folding Contact lbugnon@sinc.unl.edu.ar

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
哈哈发布了新的文献求助10
刚刚
天狼发布了新的文献求助10
1秒前
3秒前
3秒前
3秒前
脑洞疼应助ZJL采纳,获得10
3秒前
被科研耽误的艺术家完成签到,获得积分10
5秒前
5秒前
5秒前
CMQ2021102261发布了新的文献求助10
8秒前
xiaoyuanyuan完成签到,获得积分10
9秒前
LILI2发布了新的文献求助10
9秒前
SciGPT应助乔治采纳,获得10
10秒前
陈陈陈发布了新的文献求助10
12秒前
12秒前
molihuakai应助科研通管家采纳,获得20
12秒前
molihuakai应助科研通管家采纳,获得10
13秒前
13秒前
13秒前
wanci应助科研通管家采纳,获得10
13秒前
13秒前
13秒前
情怀应助科研通管家采纳,获得10
13秒前
奋斗的lin应助科研通管家采纳,获得50
13秒前
烟花应助科研通管家采纳,获得20
13秒前
丘比特应助lucaswen采纳,获得10
13秒前
大模型应助科研通管家采纳,获得10
13秒前
14秒前
NexusExplorer应助晴朗采纳,获得10
14秒前
斯文败类应助张子贤采纳,获得10
15秒前
16秒前
Akim应助无泪的天使采纳,获得10
16秒前
16秒前
ww2026应助奕奕采纳,获得20
16秒前
夜雨潇潇发布了新的文献求助10
16秒前
慕青应助橘子味汽水采纳,获得10
16秒前
Moonpie应助00采纳,获得10
17秒前
17秒前
真实的瑾瑜完成签到 ,获得积分10
17秒前
关东大圣发布了新的文献求助10
21秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
The Resilient Mindset 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
Disturbing the Quiet Life? Competition and CEO Incentives 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6652611
求助须知:如何正确求助?哪些是违规求助? 8406460
关于积分的说明 17974950
捐赠科研通 5848033
什么是DOI,文献DOI怎么找? 2971759
邀请新用户注册赠送积分活动 1947257
关于科研通互助平台的介绍 1867762