可扩展性
蛋白质设计
序列(生物学)
计算机科学
序列空间
空格(标点符号)
计算生物学
蛋白质结构
生物
数学
遗传学
离散数学
生物化学
数据库
操作系统
巴拿赫空间
作者
Christopher L. Frank,Ali Khoshouei,Lara Fuβ,Dominik Schiwietz,Dominik Putz,L W Weber,Zhixuan Zhao,Motoyuki Hattori,Shi-Hao Feng,Yosta de Stigter,Sergey Ovchinnikov,Hendrik Dietz
出处
期刊:Science
[American Association for the Advancement of Science (AAAS)]
日期:2024-10-24
卷期号:386 (6720): 439-445
标识
DOI:10.1126/science.adq1741
摘要
Machine learning (ML)–based design approaches have advanced the field of de novo protein design, with diffusion-based generative methods increasingly dominating protein design pipelines. Here, we report a “hallucination”-based protein design approach that functions in relaxed sequence space, enabling the efficient design of high-quality protein backbones over multiple scales and with broad scope of application without the need for any form of retraining. We experimentally produced and characterized more than 100 proteins. Three high-resolution crystal structures and two cryo–electron microscopy density maps of designed single-chain proteins comprising up to 1000 amino acids validate the accuracy of the method. Our pipeline can also be used to design synthetic protein-protein interactions, as validated experimentally by a set of protein heterodimers. Relaxed sequence optimization offers attractive performance with respect to designability, scope of applicability for different design problems, and scalability across protein sizes.
科研通智能强力驱动
Strongly Powered by AbleSci AI