鉴别器
计算机科学
人工智能
单眼
特征(语言学)
模式识别(心理学)
不变(物理)
注释
估计员
特征学习
计算机视觉
数学
统计
数学物理
电信
语言学
哲学
探测器
作者
Jiehua Zhang,Liang Li,Chenggang Yan,Zhan Wang,Changliang Xu,Jiyong Zhang,Chuqiao Chen
摘要
Predicting depth maps from monocular images has made an impressive performance in the past years. However, most depth estimation methods are trained with paired image-depth map data or multi-view images (e.g., stereo pair and monocular sequence), which suffer from expensive annotation costs and poor transferability. Although unsupervised domain adaptation methods are introduced to mitigate the reliance on annotated data, rare works focus on the unsupervised cross-scenario indoor monocular depth estimation. In this paper, we propose to study the generalization of depth estimation models across different indoor scenarios in an adversarial-based domain adaptation paradigm. Concretely, a domain discriminator is designed for discriminating the representation from source and target domains, while the feature extractor aims to confuse the domain discriminator by capturing domain-invariant features. Further, we reconstruct depth maps from latent representations with the supervision of labeled source data. As a result, the feature extractor learned features possess the merit of both domain-invariant and low source risk, and the depth estimator can deal with the domain shift between source and target domains. We conduct the cross-scenario and cross-dataset experiments on the ScanNet and NYU-Depth-v2 datasets to verify the effectiveness of our method and achieve impressive performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI