生成语法
蛋白质设计
合成生物学
蛋白质二级结构
蛋白质工程
计算生物学
生成模型
蛋白质结构预测
蛋白质结构
计算机科学
机器学习
人工智能
生物
生物化学
酶
作者
Bo Ni,David L. Kaplan,Markus J. Buehler
出处
期刊:Chem
[Elsevier]
日期:2023-07-01
卷期号:9 (7): 1828-1849
被引量:30
标识
DOI:10.1016/j.chempr.2023.03.020
摘要
We report two generative deep learning models that predict amino acid sequences and 3D protein structures based on secondary structure design objectives via either overall content or per-residue structure. Both models are robust regarding imperfect inputs and offer de novo design capacity as they can discover new protein sequences not yet discovered from natural mechanisms or systems. The residue-level secondary structure design model generally yields higher accuracy and more diverse sequences. These findings suggest unexplored opportunities for protein designs and functional outcomes within the vast amino acid sequences beyond known proteins. Our models, based on an attention-based diffusion model and trained on a dataset extracted from experimentally known 3D protein structures, offer numerous downstream applications in conditional generative design of various biological or engineering systems. Future work may include additional conditioning, and an exploration of other functional properties of the generated proteins for various properties beyond structural objectives.
科研通智能强力驱动
Strongly Powered by AbleSci AI