RNAcmap: a fully automatic pipeline for predicting contact maps of RNAs by evolutionary coupling analysis

计算机科学 管道(软件) 核糖核酸 计算生物学 核酸结构 核酸二级结构 蛋白质二级结构 假结 折叠(DSP实现) 数据挖掘 生物信息学 生物 遗传学 基因 生物化学 程序设计语言 工程类 电气工程
作者
Tongchuan Zhang,Jaswinder Singh,Thomas Litfin,Jian Zhan,Kuldip K. Paliwal,Yaoqi Zhou
出处
期刊:Bioinformatics [Oxford University Press]
卷期号:37 (20): 3494-3500 被引量:26
标识
DOI:10.1093/bioinformatics/btab391
摘要

The accuracy of RNA secondary and tertiary structure prediction can be significantly improved by using structural restraints derived from evolutionary coupling or direct coupling analysis. Currently, these coupling analyses relied on manually curated multiple sequence alignments collected in the Rfam database, which contains 3016 families. By comparison, millions of non-coding RNA sequences are known. Here, we established RNAcmap, a fully automatic pipeline that enables evolutionary coupling analysis for any RNA sequences. The homology search was based on the covariance model built by INFERNAL according to two secondary structure predictors: a folding-based algorithm RNAfold and the latest deep-learning method SPOT-RNA.We showed that the performance of RNAcmap is less dependent on the specific evolutionary coupling tool but is more dependent on the accuracy of secondary structure predictor with the best performance given by RNAcmap (SPOT-RNA). The performance of RNAcmap (SPOT-RNA) is comparable to that based on Rfam-supplied alignment and consistent for those sequences that are not in Rfam collections. Further improvement can be made with a simple meta predictor RNAcmap (SPOT-RNA/RNAfold) depending on which secondary structure predictor can find more homologous sequences. Reliable base-pairing information generated from RNAcmap, for RNAs with high effective homologous sequences, in particular, will be useful for aiding RNA structure prediction.RNAcmap is available as a web server at https://sparks-lab.org/server/rnacmap/ and as a standalone application along with the datasets at https://github.com/sparks-lab-org/RNAcmap_standalone. A platform independent and fully configured docker image of RNAcmap is also provided at https://hub.docker.com/r/jaswindersingh2/rnacmap.Supplementary data are available at Bioinformatics online.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
心语发布了新的文献求助10
2秒前
2秒前
甜甜青文发布了新的文献求助10
4秒前
HHUMLH完成签到 ,获得积分10
5秒前
6秒前
fengbeing发布了新的文献求助10
7秒前
7秒前
张KT完成签到,获得积分10
7秒前
清爽聋五发布了新的文献求助10
10秒前
msp发布了新的文献求助10
11秒前
甜美幻露发布了新的文献求助10
12秒前
隐形的若灵完成签到 ,获得积分10
13秒前
14秒前
jiajiajiamin发布了新的文献求助10
14秒前
江洋大盗发布了新的文献求助10
15秒前
pterionGao完成签到 ,获得积分10
15秒前
xngrass完成签到,获得积分10
15秒前
16秒前
17秒前
嘿嘿完成签到,获得积分10
18秒前
18秒前
Allez完成签到,获得积分10
19秒前
19秒前
guolingge发布了新的文献求助30
20秒前
22秒前
22秒前
搞怪诗珊发布了新的文献求助30
22秒前
活泼雁芙发布了新的文献求助10
22秒前
李健的小迷弟应助nnnnn采纳,获得10
23秒前
汉堡包应助Kaikai采纳,获得10
25秒前
邹广浩发布了新的文献求助10
28秒前
fengbeing完成签到,获得积分10
28秒前
lzs完成签到,获得积分10
29秒前
白云发布了新的文献求助10
29秒前
31秒前
31秒前
33秒前
ck0124完成签到 ,获得积分10
35秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Various Faces of Animal Metaphor in English and Polish 800
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Mass participant sport event brand associations: an analysis of two event categories 500
Photodetectors: From Ultraviolet to Infrared 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6354926
求助须知:如何正确求助?哪些是违规求助? 8170080
关于积分的说明 17198757
捐赠科研通 5410900
什么是DOI,文献DOI怎么找? 2864148
邀请新用户注册赠送积分活动 1841694
关于科研通互助平台的介绍 1690148