水准点(测量)
计算机科学
人工智能
蛋白质测序
排名(信息检索)
卷积神经网络
序列(生物学)
机器学习
深度学习
人工神经网络
亚细胞定位
数据挖掘
模式识别(心理学)
肽序列
生物
基因
遗传学
地理
大地测量学
作者
Zhijun Liao,Gaofeng Pan,Chao Sun,Jijun Tang
标识
DOI:10.1186/s12859-021-04404-0
摘要
Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information of proteins. In order to improve the prediction accuracy, we present a deep learning-based method to predict protein subcellular locations.Our method utilizes not only amino acid compositions sequence but also evolution matrices of proteins. Our method uses a bidirectional long short-term memory network that processes the entire protein sequence and a convolutional neural network that extracts features from protein sequences. The position specific scoring matrix is used as a supplement to protein sequences. Our method was trained and tested on two benchmark datasets. The experiment results show that our method yields accurate results on the two datasets with an average precision of 0.7901, ranking loss of 0.0758 and coverage of 1.2848.The experiment results show that our method outperforms five methods currently available. According to those experiments, we can see that our method is an acceptable alternative to predict protein subcellular location.
科研通智能强力驱动
Strongly Powered by AbleSci AI