HyperMPNN ‒ A general strategy to design thermostable proteins learned from hyperthermophiles
超嗜热菌
蛋白质设计
化学
计算生物学
计算机科学
生物化学
生物
蛋白质结构
古细菌
基因
作者
Moritz Ertelt,Paul G. Schlegel,Max Beining,Leonard Kaysser,Jens Meiler,Clara T. Schoeder
标识
DOI:10.1101/2024.11.26.625397
摘要
Stability is a key factor to enable the use of recombinant proteins in therapeutic or biotechnological applications. Deep learning protein design approaches like ProteinMPNN have shown strong performance both in creating novel proteins or stabilizing existing ones. However, it is unlikely that the stability of the designs will significantly exceed that of the natural proteins in the training set, which are biophysically only marginally stable. Therefore, we collected predicted protein structures from hyperthermophiles, which differ substantially in their amino acid composition from mesophiles. Notably, ProteinMPNN fails to recover their unique amino acid composition. Here we show that a retrained network on predicted proteins from hyperthermophiles, termed HyperMPNN, not only recovers this unique amino acid composition but can also be applied to proteins from non-hyperthermophiles. Using this novel approach on a protein nanoparticle with a melting temperature of 65°C resulted in designs remaining stable at 95°C. In conclusion, we created a new way to design highly thermostable proteins through self-supervised learning on data from hyperthermophiles.