伪氨基酸组成
马修斯相关系数
计算生物学
分类器(UML)
氨基酸
人工智能
刀切重采样
赖氨酸
支持向量机
化学
计算机科学
机器学习
生物
生物化学
数学
估计员
统计
二肽
标识
DOI:10.1016/j.jmgm.2017.08.020
摘要
As one of the most important and common histones post-translational modifications, crotonylation plays a key role in regulating various biological processes. The accurate identification of crotonylation sites is crucial to elucidate the underlying molecular mechanisms of crotonylation. In this study, a novel bioinformatics tool named CKSAAP_CrotSite is developed to predict crotonylation sites. The highlight of CKSAAP_CrotSite is to adopt the composition of k-spaced amino acid pairs as input encoding, and the support vector machine is employed as the classifier. As illustrated by jackknife test, CKSAAP_CrotSite achieves a promising performance with a Sensitivity of 92.45%, a Specificity of 99.17%, an Accuracy of 98.11% and a Matthew’s correlation coefficient of 0.9283, which is much better than those of the existing prediction methods. Feature analysis shows that some amino acid pairs such as ‘KxG’, ‘KG’ and ‘PxP’ may play an important role in the prediction of crotonylation sites. The results of analysis and prediction could offer useful information for elucidating the molecular mechanisms of crotonylation and related experimental validations. A user-friendly web-server for CKSAAP_CrotSite is available at 123.206.31.171/CKSAAP_CrotSite/.
科研通智能强力驱动
Strongly Powered by AbleSci AI