蛋白质工程
人工智能
定向进化
计算机科学
机器学习
定向分子进化
功能(生物学)
序列(生物学)
主动学习(机器学习)
过程(计算)
特征工程
深度学习
生物
程序设计语言
基因
进化生物学
突变体
生物化学
酶
遗传学
作者
Kevin Yang,Zachary Wu,Frances H. Arnold
出处
期刊:Nature Methods
[Springer Nature]
日期:2019-07-15
卷期号:16 (8): 687-694
被引量:735
标识
DOI:10.1038/s41592-019-0496-6
摘要
Protein engineering through machine-learning-guided directed evolution enables the optimization of protein functions. Machine-learning approaches predict how sequence maps to function in a data-driven manner without requiring a detailed model of the underlying physics or biological pathways. Such methods accelerate directed evolution by learning from the properties of characterized variants and using that information to select sequences that are likely to exhibit improved properties. Here we introduce the steps required to build machine-learning sequence–function models and to use those models to guide engineering, making recommendations at each stage. This review covers basic concepts relevant to the use of machine learning for protein engineering, as well as the current literature and applications of this engineering paradigm. We illustrate the process with two case studies. Finally, we look to future opportunities for machine learning to enable the discovery of unknown protein functions and uncover the relationship between protein sequence and function. This review provides an overview of machine learning techniques in protein engineering and illustrates the underlying principles with the help of case studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI