深度图
计算机科学
偏移量(计算机科学)
计算机视觉
人工智能
完备性(序理论)
像素
点云
推论
立体视觉
图像(数学)
数学
数学分析
程序设计语言
作者
Kui Lin,Lei Li,Jianjun Zhang,Xing Zheng,Suping Wu
标识
DOI:10.1109/icme51207.2021.9428281
摘要
Multi-view stereo based on deep learning is mostly dedicated to improving the accuracy of point clouds, whlile complex scenes, occlusion, and other factors limit their reconstruction completeness, especially in the area with drastic changes in depth direction. In this paper, we propose a multi-view stereo network based on depth edge flow (DEF-MVSNet), using the reference image as a guide to dynamically infer the edge coordinates to improve reconstruction completeness. First, we ignore the boundaries in the depth prediction stage to generate better initial depth inference results. Then, we use a EdgeDetect module to extract the obviously features of the reference image and predict the pixel offset of the depth map. Finally, the EdgeFlow module modifies the initial depth map coordinates according to the offset and uses multiple iterations to dynamically update the depth map. The experimental results prove that our method has a great improvement in the completeness of reconstruction compared with MVSNet and R-MVSNet without increasing the memory and time overhead. Code and models are publicly available at https://github.com/linkuizzZ/EF-MVSNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI