嗜热菌
热稳定性
刀切重采样
特征选择
中层
计算生物学
蛋白质工程
生物化学
人类蛋白质
支持向量机
氨基酸
交叉验证
水准点(测量)
计算机科学
生物
化学
酶
机器学习
数学
遗传学
细菌
基因
统计
地理
估计员
大地测量学
标识
DOI:10.1016/j.mimet.2010.10.013
摘要
The thermostability of proteins is particularly relevant for enzyme engineering. Developing a computational method to identify mesophilic proteins would be helpful for protein engineering and design. In this work, we developed support vector machine based method to predict thermophilic proteins using the information of amino acid distribution and selected amino acid pairs. A reliable benchmark dataset including 915 thermophilic proteins and 793 non-thermophilic proteins was constructed for training and testing the proposed models. Results showed that 93.8% thermophilic proteins and 92.7% non-thermophilic proteins could be correctly predicted by using jackknife cross-validation. High predictive successful rate exhibits that this model can be applied for designing stable proteins.
科研通智能强力驱动
Strongly Powered by AbleSci AI