计算机科学
运动学
概率逻辑
噪音(视频)
扩散过程
采样(信号处理)
扩散
人工智能
运动捕捉
算法
联合概率分布
统计物理学
运动(物理)
计算机视觉
数学
物理
统计
知识管理
创新扩散
滤波器(信号处理)
经典力学
图像(数学)
热力学
作者
Wei Dong,Huaijiang Sun,Bin Li,Jianfeng Lu,Weiqing Li,Xiaoning Sun,Shengxiang Hu
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence
[Association for the Advancement of Artificial Intelligence (AAAI)]
日期:2023-06-26
卷期号:37 (5): 6110-6118
被引量:3
标识
DOI:10.1609/aaai.v37i5.25754
摘要
Stochastic human motion prediction aims to forecast multiple plausible future motions given a single pose sequence from the past. Most previous works focus on designing elaborate losses to improve the accuracy, while the diversity is typically characterized by randomly sampling a set of latent variables from the latent prior, which is then decoded into possible motions. This joint training of sampling and decoding, however, suffers from posterior collapse as the learned latent variables tend to be ignored by a strong decoder, leading to limited diversity. Alternatively, inspired by the diffusion process in nonequilibrium thermodynamics, we propose MotionDiff, a diffusion probabilistic model to treat the kinematics of human joints as heated particles, which will diffuse from original states to a noise distribution. This process not only offers a natural way to obtain the "whitened'' latents without any trainable parameters, but also introduces a new noise in each diffusion step, both of which facilitate more diverse motions. Human motion prediction is then regarded as the reverse diffusion process that converts the noise distribution into realistic future motions conditioned on the observed sequence. Specifically, MotionDiff consists of two parts: a spatial-temporal transformer-based diffusion network to generate diverse yet plausible motions, and a flexible refinement network to further enable geometric losses and align with the ground truth. Experimental results on two datasets demonstrate that our model yields the competitive performance in terms of both diversity and accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI