计算机科学
人工智能
监督学习
管道(软件)
帧(网络)
机器学习
单眼
基本事实
集合(抽象数据类型)
人工神经网络
任务(项目管理)
深度学习
计算机视觉
电信
管理
经济
程序设计语言
作者
Ilya Makarov,Maria Bakhanova,Sergey Nikolenko,Olga Gerasimova
出处
期刊:PeerJ
[PeerJ]
日期:2022-01-31
卷期号:8: e865-e865
被引量:26
摘要
Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced to make self-supervised depth prediction more precise. However, most existing approaches still focus on single-frame depth estimation, even in the self-supervised setting. Since most methods can operate with frame sequences, we believe that the quality of current models can be significantly improved with the help of information about previous frames. In this work, we study different ways of integrating recurrent blocks and attention mechanisms into a common self-supervised depth estimation pipeline. We propose a set of modifications that utilize temporal information from previous frames and provide new neural network architectures for monocular depth estimation in a self-supervised manner. Our experiments on the KITTI dataset show that proposed modifications can be an effective tool for exploiting temporal information in a depth prediction pipeline.
科研通智能强力驱动
Strongly Powered by AbleSci AI