马尔可夫链
计算机科学
不变(物理)
生成模型
统计物理学
条件概率分布
扩散过程
生成语法
算法
人工智能
数学
机器学习
物理
统计
知识管理
数学物理
创新扩散
作者
Minkai Xu,Lantao Yu,Yang Song,Chence Shi,Stefano Ermon,Jian Tang
出处
期刊:Cornell University - arXiv
日期:2022-03-06
被引量:167
标识
DOI:10.48550/arxiv.2203.02923
摘要
Predicting molecular conformations from molecular graphs is a fundamental problem in cheminformatics and drug discovery. Recently, significant progress has been achieved with machine learning approaches, especially with deep generative models. Inspired by the diffusion process in classical non-equilibrium thermodynamics where heated particles will diffuse from original states to a noise distribution, in this paper, we propose a novel generative model named GeoDiff for molecular conformation prediction. GeoDiff treats each atom as a particle and learns to directly reverse the diffusion process (i.e., transforming from a noise distribution to stable conformations) as a Markov chain. Modeling such a generation process is however very challenging as the likelihood of conformations should be roto-translational invariant. We theoretically show that Markov chains evolving with equivariant Markov kernels can induce an invariant distribution by design, and further propose building blocks for the Markov kernels to preserve the desirable equivariance property. The whole framework can be efficiently trained in an end-to-end fashion by optimizing a weighted variational lower bound to the (conditional) likelihood. Experiments on multiple benchmarks show that GeoDiff is superior or comparable to existing state-of-the-art approaches, especially on large molecules.
科研通智能强力驱动
Strongly Powered by AbleSci AI