计算机科学
人工智能
计算机视觉
杠杆(统计)
渲染(计算机图形)
体绘制
动画
射线投射
计算机动画
计算机图形学(图像)
作者
Xiangjun Gao,Jiaolong Yang,Jongyoo Kim,Sida Peng,Zicheng Liu,Xin Tong
标识
DOI:10.1109/tpami.2022.3205910
摘要
There has been rapid progress recently on 3D human rendering, including novel view synthesis and pose animation, based on the advances of neural radiance fields (NeRF). However, most existing methods focus on person-specific training and their training typically requires multi-view videos. This paper deals with a new challenging task - rendering novel views and novel poses for a person unseen in training, using only multiview still images as input without videos. For this task, we propose a simple yet surprisingly effective method to train a generalizable NeRF with multiview images as conditional input. The key ingredient is a dedicated representation combining a canonical NeRF and a volume deformation scheme. Using a canonical space enables our method to learn shared properties of human and easily generalize to different people. Volume deformation is used to connect the canonical space with input and target images and query image features for radiance and density prediction. We leverage the parametric 3D human model fitted on the input images to derive the deformation, which works quite well in practice when combined with our canonical NeRF. The experiments on both real and synthetic data with the novel view synthesis and pose animation tasks collectively demonstrate the efficacy of our method.
科研通智能强力驱动
Strongly Powered by AbleSci AI