人工智能
计算机科学
匹配(统计)
增采样
人工神经网络
算法
理论计算机科学
计算机视觉
图像(数学)
数学
统计
作者
Vladimir Tankovich,Christian Häne,Yinda Zhang,Adarsh Kowdle,Sean Fanello,Sofien Bouaziz
标识
DOI:10.1109/cvpr46437.2021.01413
摘要
This paper presents HITNet, a novel neural network architecture for real-time stereo matching. Contrary to many recent neural network approaches that operate on a full cost volume and rely on 3D convolutions, our approach does not explicitly build a volume and instead relies on a fast multi-resolution initialization step, differentiable 2D geometric propagation and warping mechanisms to infer disparity hypotheses. To achieve a high level of accuracy, our network not only geometrically reasons about disparities but also infers slanted plane hypotheses allowing to more accurately perform geometric warping and upsampling operations. Our architecture is inherently multi-resolution allowing the propagation of information across different levels. Multiple experiments prove the effectiveness of the proposed approach at a fraction of the computation required by state-of-the-art methods. At the time of writing, HITNet ranks 1 st -3 rd on all the metrics published on the ETH3D website for two view stereo, ranks 1 st on most of the metrics amongst all the end-to-end learning approaches on Middlebury-v3, ranks 1 st on the popular KITTI 2012 and 2015 benchmarks among the published methods faster than 100 ms.
科研通智能强力驱动
Strongly Powered by AbleSci AI