计算机科学
单眼
人工智能
计算机视觉
编码器
一般化
航程(航空)
变压器
深度图
图像(数学)
数学
工程类
数学分析
电压
航空航天工程
操作系统
电气工程
作者
Jianghai Shuai,Ming Li,Yongkang Feng,Yang Li,Sidan Du
标识
DOI:10.1109/uemcon59035.2023.10316039
摘要
In the field of computer vision, monocular depth estimation has garnered significant attention as a research direction. However, current depth estimation methods often overlook the impact of depth range variations in indoor and outdoor scenes, consequently limiting the model’s generalization ability. To achieve high-precision depth estimation across different depth ranges, we propose a new method. We employ the pretrained model Dinov2 as encoder, combined with decoder based on CNN architecture, to enhance the network’s capacity for extracting global information from indoor and outdoor scenes. Also, we design a mapping module to transform diverse depth ranges into a unified 0-1 range, which can effectively adapt to indoor and outdoor scenes. We validate our method on the DIODE dataset, which comprises mixed indoor and outdoor scenes. Experimental results demonstrate that our method achieves higher depth estimation accuracy and stronger generalization performance when dealing with scenes of diverse depth ranges.
科研通智能强力驱动
Strongly Powered by AbleSci AI