蛋白质工程
蛋白质设计
计算机科学
突变
突变蛋白
计算生物学
比例(比率)
突变体
机器学习
蛋白质结构
生物
遗传学
生物化学
基因
物理
量子力学
酶
作者
Tianshu Wang,Xin Jin,Xiaoli Lü,Xiaoping Min,Shuping Ge,Shaowei Li
标识
DOI:10.3389/fgene.2023.1347667
摘要
Introduction: Protein engineering, which aims to improve the properties and functions of proteins, holds great research significance and application value. However, current models that predict the effects of amino acid substitutions often perform poorly when evaluated for precision. Recent research has shown that ProteinMPNN, a large-scale pre-training sequence design model based on protein structure, performs exceptionally well. It is capable of designing mutants with structures similar to the original protein. When applied to the field of protein engineering, the diverse designs for mutation positions generated by this model can be viewed as a more precise mutation range. Methods: We collected three biological experimental datasets and compared the design results of ProteinMPNN for wild-type proteins with the experimental datasets to verify the ability of ProteinMPNN in improving protein fitness. Results: The validation on biological experimental datasets shows that ProteinMPNN has the ability to design mutation types with higher fitness in single and multi-point mutations. We have verified the high accuracy of ProteinMPNN in protein engineering tasks from both positive and negative perspectives. Discussion: Our research indicates that using large-scale pre trained models to design protein mutants provides a new approach for protein engineering, providing strong support for guiding biological experiments and applications in biotechnology.
科研通智能强力驱动
Strongly Powered by AbleSci AI