计算机科学
蛋白质结构预测
梯度下降
蛋白质结构
构造(python库)
人工神经网络
人工智能
简单(哲学)
算法
机器学习
蛋白质超家族
功能(生物学)
计算生物学
生物系统
卡斯普
数据挖掘
生物
遗传学
认识论
基因
哲学
程序设计语言
生物化学
作者
Andrew Senior,Richard Evans,John Jumper,James Kirkpatrick,Laurent Sifre,Tim Green,Chongli Qin,Augustin Žídek,Alexander Nelson,Alex Bridgland,Hugo Penedones,Stig Petersen,Karen Simonyan,Steve Crossan,Pushmeet Kohli,David T. Jones,David Silver,Koray Kavukcuoglu,Demis Hassabis
出处
期刊:Nature
[Springer Nature]
日期:2020-01-15
卷期号:577 (7792): 706-710
被引量:2697
标识
DOI:10.1038/s41586-019-1923-7
摘要
Protein structure prediction can be used to determine the three-dimensional shape of a protein from its amino acid sequence1. This problem is of fundamental importance as the structure of a protein largely determines its function2; however, protein structures can be difficult to determine experimentally. Considerable progress has recently been made by leveraging genetic information. It is possible to infer which amino acid residues are in contact by analysing covariation in homologous sequences, which aids in the prediction of protein structures3. Here we show that we can train a neural network to make accurate predictions of the distances between pairs of residues, which convey more information about the structure than contact predictions. Using this information, we construct a potential of mean force4 that can accurately describe the shape of a protein. We find that the resulting potential can be optimized by a simple gradient descent algorithm to generate structures without complex sampling procedures. The resulting system, named AlphaFold, achieves high accuracy, even for sequences with fewer homologous sequences. In the recent Critical Assessment of Protein Structure Prediction5 (CASP13)—a blind assessment of the state of the field—AlphaFold created high-accuracy structures (with template modelling (TM) scores6 of 0.7 or higher) for 24 out of 43 free modelling domains, whereas the next best method, which used sampling and contact information, achieved such accuracy for only 14 out of 43 domains. AlphaFold represents a considerable advance in protein-structure prediction. We expect this increased accuracy to enable insights into the function and malfunction of proteins, especially in cases for which no structures for homologous proteins have been experimentally determined7. AlphaFold predicts the distances between pairs of residues, is used to construct potentials of mean force that accurately describe the shape of a protein and can be optimized with gradient descent to predict protein structures.
科研通智能强力驱动
Strongly Powered by AbleSci AI