计算机科学
UniProt公司
卷积神经网络
人工智能
丝氨酸
深度学习
苏氨酸
循环神经网络
蛋白质磷酸化
模式识别(心理学)
磷酸化
机器学习
计算生物学
人工神经网络
生物
生物化学
基因
蛋白激酶A
作者
Haixia Long,Zhao Sun,Manzhi Li,Hai Fu,Ming Lin
出处
期刊:Current Bioinformatics
[Bentham Science]
日期:2019-10-17
卷期号:15 (4): 300-308
被引量:28
标识
DOI:10.2174/1574893614666190902154332
摘要
Background: Protein phosphorylation is one of the most important Post-translational Modifications (PTMs) occurring at amino acid residues serine (S), threonine (T), and tyrosine (Y). It plays critical roles in protein structure and function predicting. With the development of novel high-throughput sequencing technologies, there are a huge amount of protein sequences being generated and stored in databases. Objective: It is of great importance in both basic research and drug development to quickly and accurately predict which residues of S, T, or Y can be phosphorylated. Methods: In order to solve the problem, a novel hybrid deep learning model with a convolutional neural network and bi-directional long short-term memory recurrent neural network (CNN+BLSTM) is proposed for predicting phosphorylation sites in proteins. The model contains a list of layers that transform the input data into an output class, in which the convolution layer captures higher-level abstraction features of amino acid, while the recurrent layer captures long-term dependencies between amino acids to improve predictions. The joint model learns interactions between higher-level features derived from the protein sequence to predict the phosphorylated sites. Results: We applied our model together with two canonical methods namely iPhos-PseEn and MusiteDeep. A 5-fold cross-validation process indicated that CNN+BLSTM outperforms the two competitors in various evaluation metrics like the area under the receiver operating characteristic and precision-recall curves, the Matthews correlation coefficient, F-measure, accuracy, and so on. Conclusion: CNN+BLSTM is promising in identifying potential protein phosphorylation for further experimental validation.
科研通智能强力驱动
Strongly Powered by AbleSci AI