GraphLncLoc: long non-coding RNA subcellular localization prediction using graph convolutional networks based on sequence to graph transformation

德布鲁因图 计算机科学 编码 德布鲁恩序列 图形 卷积神经网络 深度学习 人工智能 模式识别(心理学) 理论计算机科学 生物 数学 遗传学 基因 组合数学
作者
Min Li,Baoying Zhao,Rui Yin,Chengqian Lu,Fei Guo,Min Zeng
出处
期刊:Briefings in Bioinformatics [Oxford University Press]
卷期号:24 (1) 被引量:5
标识
DOI:10.1093/bib/bbac565
摘要

The subcellular localization of long non-coding RNAs (lncRNAs) is crucial for understanding lncRNA functions. Most of existing lncRNA subcellular localization prediction methods use k-mer frequency features to encode lncRNA sequences. However, k-mer frequency features lose sequence order information and fail to capture sequence patterns and motifs of different lengths. In this paper, we proposed GraphLncLoc, a graph convolutional network-based deep learning model, for predicting lncRNA subcellular localization. Unlike previous studies encoding lncRNA sequences by using k-mer frequency features, GraphLncLoc transforms lncRNA sequences into de Bruijn graphs, which transforms the sequence classification problem into a graph classification problem. To extract the high-level features from the de Bruijn graph, GraphLncLoc employs graph convolutional networks to learn latent representations. Then, the high-level feature vectors derived from de Bruijn graph are fed into a fully connected layer to perform the prediction task. Extensive experiments show that GraphLncLoc achieves better performance than traditional machine learning models and existing predictors. In addition, our analyses show that transforming sequences into graphs has more distinguishable features and is more robust than k-mer frequency features. The case study shows that GraphLncLoc can uncover important motifs for nucleus subcellular localization. GraphLncLoc web server is available at http://csuligroup.com:8000/GraphLncLoc/.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
二二完成签到,获得积分10
2秒前
3秒前
学术小天才完成签到 ,获得积分10
4秒前
mildJYY完成签到,获得积分10
4秒前
GD88发布了新的文献求助10
4秒前
4秒前
秋风细细雨完成签到,获得积分10
5秒前
小小发布了新的文献求助10
5秒前
lff完成签到,获得积分10
6秒前
6秒前
雷大帅发布了新的文献求助10
8秒前
VitoLi发布了新的文献求助10
9秒前
9秒前
10秒前
11秒前
GD88发布了新的文献求助20
11秒前
韦涔完成签到,获得积分10
12秒前
12秒前
鹤卿完成签到,获得积分10
13秒前
14秒前
蓝天发布了新的文献求助30
14秒前
fann完成签到,获得积分10
15秒前
NexusExplorer应助陈曦读研版采纳,获得10
15秒前
车厘子发布了新的文献求助10
15秒前
雷大帅完成签到,获得积分10
16秒前
矿泉水发布了新的文献求助10
16秒前
18秒前
20秒前
20秒前
20秒前
Ava应助zzz采纳,获得10
22秒前
22秒前
科研通AI6.2应助LBM采纳,获得10
22秒前
22秒前
orixero应助科研通管家采纳,获得10
22秒前
慕青应助科研通管家采纳,获得10
22秒前
Hello应助科研通管家采纳,获得10
22秒前
CipherSage应助科研通管家采纳,获得10
22秒前
Orange应助科研通管家采纳,获得10
22秒前
22秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
Quality by Design - An Indispensable Approach to Accelerate Biopharmaceutical Product Development 800
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
Signals, Systems, and Signal Processing 610
A Social and Cultural History of the Hellenistic World 500
Chemistry and Physics of Carbon Volume 15 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6397542
求助须知:如何正确求助?哪些是违规求助? 8212928
关于积分的说明 17401464
捐赠科研通 5450944
什么是DOI,文献DOI怎么找? 2881170
邀请新用户注册赠送积分活动 1857682
关于科研通互助平台的介绍 1699724