人工智能
机器学习
计算机科学
超参数
特征选择
降维
支持向量机
特征(语言学)
维数之咒
蛋白质结构预测
蛋白质功能预测
功能(生物学)
人工神经网络
蛋白质功能
蛋白质结构
生物
基因
哲学
进化生物学
生物化学
语言学
作者
Rosalin Bonetta,Gianluca Valentino
出处
期刊:Proteins
[Wiley]
日期:2019-10-11
卷期号:88 (3): 397-413
被引量:119
摘要
Abstract Proteins play important roles in living organisms, and their function is directly linked with their structure. Due to the growing gap between the number of proteins being discovered and their functional characterization (in particular as a result of experimental limitations), reliable prediction of protein function through computational means has become crucial. This paper reviews the machine learning techniques used in the literature, following their evolution from simple algorithms such as logistic regression to more advanced methods like support vector machines and modern deep neural networks. Hyperparameter optimization methods adopted to boost prediction performance are presented. In parallel, the metamorphosis in the features used by these algorithms from classical physicochemical properties and amino acid composition, up to text‐derived features from biomedical literature and learned feature representations using autoencoders, together with feature selection and dimensionality reduction techniques, are also reviewed. The success stories in the application of these techniques to both general and specific protein function prediction are discussed.
科研通智能强力驱动
Strongly Powered by AbleSci AI