序列空间
计算生物学
蛋白质测序
生成语法
人工智能
肽序列
功能(生物学)
蛋白质家族
蛋白质工程
生物
序列标志
序列比对
序列(生物学)
计算机科学
遗传学
生物化学
酶
基因
数学
巴拿赫空间
纯数学
作者
Donatas Repecka,Vykintas Jauniškis,Laurynas Karpus,Elzbieta Rembeza,Irmantas Rokaitis,Jan Zrimec,Simona Povilonienė,Audrius Laurynėnas,Sandra Viknander,Wissam Abuajwa,Otto Savolainen,Rolandas Meškys,Martin K. M. Engqvist,Aleksej Zelezniak
标识
DOI:10.1038/s42256-021-00310-5
摘要
De novo protein design for catalysis of any desired chemical reaction is a long-standing goal in protein engineering because of the broad spectrum of technological, scientific and medical applications. However, mapping protein sequence to protein function is currently neither computationally nor experimentally tangible. Here, we develop ProteinGAN, a self-attention-based variant of the generative adversarial network that is able to ‘learn’ natural protein sequence diversity and enables the generation of functional protein sequences. ProteinGAN learns the evolutionary relationships of protein sequences directly from the complex multidimensional amino-acid sequence space and creates new, highly diverse sequence variants with natural-like physical properties. Using malate dehydrogenase (MDH) as a template enzyme, we show that 24% (13 out of 55 tested) of the ProteinGAN-generated and experimentally tested sequences are soluble and display MDH catalytic activity in the tested conditions in vitro, including a highly mutated variant of 106 amino-acid substitutions. ProteinGAN therefore demonstrates the potential of artificial intelligence to rapidly generate highly diverse functional proteins within the allowed biological constraints of the sequence space. A protein’s three-dimensional structure and properties are defined by its amino-acid sequence, but mapping protein sequence to protein function is a computationally highly intensive task. A new generative adversarial network approach learns from natural protein sequences and generates new, diverse protein sequence variations, which are experimentally tested.
科研通智能强力驱动
Strongly Powered by AbleSci AI