深度学习
计算生物学
人工智能
计算机科学
核糖核酸
生物
遗传学
基因
作者
Haoquan Liu,Yiren Jian,Chen Zeng,Yunjie Zhao
标识
DOI:10.1038/s42003-025-07694-9
摘要
Accurate computational determination of RNA-protein interactions remains challenging, particularly when encountering unknown RNAs and proteins. The limited number of RNAs and their flexibility constrained the effectiveness of the deep-learning models for RNA-protein interaction prediction. Here, we introduce ZHMolGraph, which integrates graph neural network and unsupervised large language models to predict RNA-protein interaction. We validate ZHMolGraph predictions on two benchmark datasets and outperform the current best methods. For the dataset of entirely unknown RNAs and proteins, ZHMolGraph shows an improvement in achieving high AUROC of 79.8% and AUPRC of 82.0%. This represents a substantial improvement of 7.1%–28.7% in AUROC and 4.6%–30.0% in AUPRC over other methods. We utilize ZHMolGraph to enhance the challenging SARS-CoV-2 RPI and unbound RNA-protein complex predictions. Such enhancements make ZHMolGraph a reliable option for genome-wide RNA-protein prediction. ZHMolGraph holds broad potential for modeling and designing RNA-protein complexes. This study has developed a new method to predict RNA-protein interactions by combining graph neural networks and unsupervised large language models. The model showed superior performances by benchmark tests, especially for previously unseen RNAs and proteins.
科研通智能强力驱动
Strongly Powered by AbleSci AI