亚像素渲染
人工智能
计算机科学
卷积神经网络
模式识别(心理学)
匹配(统计)
滤波器(信号处理)
一致性(知识库)
集合(抽象数据类型)
相似性(几何)
计算机视觉
人工神经网络
数据集
图像(数学)
像素
数学
统计
程序设计语言
作者
Jure Žbontar,Yann LeCun
出处
期刊:Cornell University - arXiv
日期:2015-10-20
被引量:908
摘要
We present a method for extracting depth information from a rectified image pair. Our approach focuses on the first stage of many stereo algorithms: the matching cost computation. We approach the problem by learning a similarity measure on small image patches using a convolutional neural network. Training is carried out in a supervised manner by constructing a binary classification data set with examples of similar and dissimilar pairs of patches. We examine two network architectures for this task: one tuned for speed, the other for accuracy. The output of the convolutional neural network is used to initialize the stereo matching cost. A series of post-processing steps follow: cross-based cost aggregation, semiglobal matching, a left-right consistency check, subpixel enhancement, a median filter, and a bilateral filter. We evaluate our method on the KITTI 2012, KITTI 2015, and Middlebury stereo data sets and show that it outperforms other approaches on all three data sets.
科研通智能强力驱动
Strongly Powered by AbleSci AI