核糖核酸
计算机科学
人工智能
深度学习
计算生物学
核酸结构
核酸二级结构
机器学习
灵活性(工程)
概化理论
管道(软件)
生物
遗传学
数学
基因
统计
程序设计语言
作者
Tao Shen,Zhihang Hu,Siqi Sun,Di Liu,Felix Wong,Jiuming Wang,Jiayang Chen,Yixuan Wang,Liang Hong,Jin Xiao,Liangzhen Zheng,Tejas Krishnamoorthi,Irwin King,Sheng Wang,Peng Yin,James J. Collins,Yu Li
标识
DOI:10.1038/s41592-024-02487-0
摘要
Accurate prediction of RNA three-dimensional (3D) structures remains an unsolved challenge. Determining RNA 3D structures is crucial for understanding their functions and informing RNA-targeting drug development and synthetic biology design. The structural flexibility of RNA, which leads to the scarcity of experimentally determined data, complicates computational prediction efforts. Here we present RhoFold+, an RNA language model-based deep learning method that accurately predicts 3D structures of single-chain RNAs from sequences. By integrating an RNA language model pretrained on ~23.7 million RNA sequences and leveraging techniques to address data scarcity, RhoFold+ offers a fully automated end-to-end pipeline for RNA 3D structure prediction. Retrospective evaluations on RNA-Puzzles and CASP15 natural RNA targets demonstrate the superiority of RhoFold+ over existing methods, including human expert groups. Its efficacy and generalizability are further validated through cross-family and cross-type assessments, as well as time-censored benchmarks. Additionally, RhoFold+ predicts RNA secondary structures and interhelical angles, providing empirically verifiable features that broaden its applicability to RNA structure and function studies. RhoFold+ is an end-to-end language model-based deep learning method to predict RNA three-dimensional structures of single-chain RNAs from sequences.
科研通智能强力驱动
Strongly Powered by AbleSci AI