计算机科学
三维重建
人工智能
特征(语言学)
计算机视觉
迭代重建
曲面重建
体积热力学
元数据
曲面(拓扑)
数学
几何学
操作系统
物理
哲学
量子力学
语言学
作者
Mohamed Sayed,John Gibson,Jamie Carlin Watson,Victor Adrian Prisacariu,Michael Firman,Clément Godard
出处
期刊:Cornell University - arXiv
日期:2022-08-31
标识
DOI:10.48550/arxiv.2208.14743
摘要
Traditionally, 3D indoor scene reconstruction from posed images happens in two phases: per-image depth estimation, followed by depth merging and surface reconstruction. Recently, a family of methods have emerged that perform reconstruction directly in final 3D volumetric feature space. While these methods have shown impressive reconstruction results, they rely on expensive 3D convolutional layers, limiting their application in resource-constrained environments. In this work, we instead go back to the traditional route, and show how focusing on high quality multi-view depth prediction leads to highly accurate 3D reconstructions using simple off-the-shelf depth fusion. We propose a simple state-of-the-art multi-view depth estimator with two main contributions: 1) a carefully-designed 2D CNN which utilizes strong image priors alongside a plane-sweep feature volume and geometric losses, combined with 2) the integration of keyframe and geometric metadata into the cost volume which allows informed depth plane scoring. Our method achieves a significant lead over the current state-of-the-art for depth estimation and close or better for 3D reconstruction on ScanNet and 7-Scenes, yet still allows for online real-time low-memory reconstruction. Code, models and results are available at https://nianticlabs.github.io/simplerecon
科研通智能强力驱动
Strongly Powered by AbleSci AI