序列(生物学)
计算机科学
概率逻辑
生成模型
生成语法
任务(项目管理)
生物系统
拓扑(电路)
计算生物学
算法
人工智能
生物
数学
工程类
遗传学
组合数学
系统工程
作者
Namrata Anand,Tudor Achim
出处
期刊:Cornell University - arXiv
日期:2022-01-01
被引量:58
标识
DOI:10.48550/arxiv.2205.15019
摘要
Proteins are macromolecules that mediate a significant fraction of the cellular processes that underlie life. An important task in bioengineering is designing proteins with specific 3D structures and chemical properties which enable targeted functions. To this end, we introduce a generative model of both protein structure and sequence that can operate at significantly larger scales than previous molecular generative modeling approaches. The model is learned entirely from experimental data and conditions its generation on a compact specification of protein topology to produce a full-atom backbone configuration as well as sequence and side-chain predictions. We demonstrate the quality of the model via qualitative and quantitative analysis of its samples. Videos of sampling trajectories are available at https://nanand2.github.io/proteins .
科研通智能强力驱动
Strongly Powered by AbleSci AI