蛋白质设计
序列(生物学)
跨膜蛋白
纳米孔
水准点(测量)
回路建模
计算生物学
计算机科学
蛋白质测序
蛋白质工程
功能(生物学)
蛋白质结构
肽序列
算法
生物
蛋白质结构预测
生物化学
纳米技术
酶
材料科学
遗传学
基因
受体
地理
大地测量学
作者
Marissa D Dolorfino,Anastassia Vorobieva
标识
DOI:10.1101/2024.01.16.575764
摘要
Abstract Recent deep-learning (DL) protein design methods have been successfully applied to a range of protein design problems including the de novo design of novel folds, protein binders, and enzymes. However, DL methods have yet to meet the challenge of de novo membrane protein (MP) and the design of complex β-sheet folds. We performed a comprehensive benchmark of one DL protein sequence design method, ProteinMPNN, using transmembrane and water-soluble β-barrel folds as a model, and compared the performance of ProteinMPNN to the new membrane-specific Rosetta Franklin2023 energy function. We tested the effect of input backbone refinement on ProteinMPNN performance and found that given refined and well-defined inputs, ProteinMPNN more accurately captures global sequence properties despite complex folding biophysics. It generates more diverse TMB sequences than Franklin2023 in pore-facing positions. In addition, ProteinMPNN generated TMB sequences that passed state-of-the-art in silico filters for experimental validation, suggesting that the model could be used in de novo design tasks of diverse nanopores for single-molecule sensing and sequencing. Lastly, our results indicate that the low success rate of ProteinMPNN for the design of β-sheet proteins stems from backbone input accuracy rather than software limitations.
科研通智能强力驱动
Strongly Powered by AbleSci AI