计算机科学
极线几何
观点
人工智能
计算机视觉
视图合成
虚拟现实
图像(数学)
渲染(计算机图形)
艺术
视觉艺术
作者
Hung-Yu Tseng,Qinbo Li,Chang-Il Kim,Suhib Alsisan,Jia‐Bin Huang,Johannes Kopf
标识
DOI:10.1109/cvpr52729.2023.01609
摘要
Novel view synthesis from a single image has been a cornerstone problem for many Virtual Reality applications that provide immersive experiences. However, most existing techniques can only synthesize novel views within a limited range of camera motion or fail to generate consistent and high-quality novel views under significant camera movement. In this work, we propose a pose-guided diffusion model to generate a consistent long-term video of novel views from a single image. We design an attention layer that uses epipolar lines as constraints to facilitate the association between different viewpoints. Experimental results on synthetic and real-world datasets demonstrate the effectiveness of the proposed diffusion model against state-of-the-art transformer-based and GAN-based approaches. More qualitative results are available at https://poseguided-diffusion.github.io/.
科研通智能强力驱动
Strongly Powered by AbleSci AI