计算机科学
人工智能
面子(社会学概念)
计算机视觉
单眼
多边形网格
像素
顶点(图论)
迭代重建
三维重建
模式识别(心理学)
计算机图形学(图像)
图形
社会科学
理论计算机科学
社会学
作者
Yong Li,Qiang Hao,Jianguo Hu,Xinmiao Pan,Zechao Li,Zhen Cui
标识
DOI:10.1109/tmm.2022.3212282
摘要
3D face reconstruction from a single image is a vital task in various multimedia applications. A key challenge for 3D face shape reconstruction is to build the correct dense face correspondence between the monocular input face and the deformable mesh. Most existing methods rely on shape labels fitted by traditional methods or strong priors such as multi-view geometry consistency. In contrast, we propose an innovative 3D Modulated Morphable Model (3D3M) to learn the dense shape correspondence from monocular images in a self-supervised manner. Specifically, given a batch of input faces, 3D3M encodes their 3DMM attributes (shape, texture, lighting, etc.) and then randomly shuffles the 3DMM attributes to generate the attribute-changed faces. The attribute-changed faces can be encoded and rendered back in a cycle-consistent manner, which enables us to utilize the self-supervised consistencies in dense mesh vertices and reconstructed pixels. The dense shape and pixel correspondence enable us to adopt a series of self-supervised constraints to fit the 3D face model accurately and learn the per-vertex correctives end-to-end. 3D3M builds excellent high-quality 3D face reconstruction results from monocular images. Both quantitative and qualitative experimental results have verified the superiority of 3D3M over prior arts on 3D face reconstruction and face alignment.
科研通智能强力驱动
Strongly Powered by AbleSci AI