成对比较
计算机科学
推论
源代码
算法
编码器
核酸二级结构
数据挖掘
人工智能
核糖核酸
生物化学
化学
基因
操作系统
作者
Enbin Yang,Hao Zhang,Zinan Zang,Zhiyong Zhou,Shuo Wang,Zhen Liu,Yuanning Liu
标识
DOI:10.1016/j.compbiomed.2023.107246
摘要
RNA secondary structure is essential for predicting the tertiary structure and understanding RNA function. Recent research tends to stack numerous modules to design large deep-learning models. This can increase the accuracy to more than 70%, as well as significant training costs and prediction efficiency. We proposed a model with three feature extractors called GCNfold. Structure Extractor utilizes a three-layer Graph Convolutional Network (GCN) to mine the structural information of RNA, such as stems, hairpin, and internal loops. Structure and Sequence Fusion embeds structural information into sequences with Transformer Encoders. Long-distance Dependency Extractor captures long-range pairwise relationships by UNet. The experiments indicate that GCNfold has a small number of parameters, a fast inference speed, and a high accuracy among all models with over 80% accuracy. Additionally, GCNfold-Small takes only 90ms to infer an RNA secondary structure and can achieve close to 90% accuracy on average. The GCNfold code is available on Github https://github.com/EnbinYang/GCNfold.
科研通智能强力驱动
Strongly Powered by AbleSci AI