计算机科学
人工智能
卷积神经网络
单眼
水准点(测量)
GSM演进的增强数据速率
软件部署
机器学习
深度学习
边缘设备
机器人学
变压器
蒸馏
机器人
工程类
云计算
化学
大地测量学
有机化学
电压
电气工程
地理
操作系统
作者
Gao Wei,D. Rajeswara Rao,Yang Yang,Jie Chen
出处
期刊:IEEE robotics and automation letters
日期:2023-12-01
卷期号:8 (12): 8470-8477
标识
DOI:10.1109/lra.2023.3330054
摘要
Self-supervised monocular depth estimation (MDE) has great potential for deployment in a wide range of applications, including virtual reality, autonomous driving, and robotics. Nevertheless, most previous studies focused on complex architectures to pursue better performance in MDE. In this letter, we aim to develop a lightweight yet highly effective self-supervised MDE model that can deliver competitive performance in edge devices. We introduce a novel MobileViT-based depth (MViTDepth) network that can effectively capture both local features and global information by leveraging the strengths of convolutional neural networks (CNNs) and a vision transformer (ViT). To further compress the proposed MViTDepth model, we employ knowledge distillation, which leads to improved depth estimation performance. Specifically, the self-supervised MDE MonoViT is used as a teacher model to construct the knowledge distillation loss for optimizing a student model. Experimental results on benchmark datasets demonstrate that the proposed MViTDepth significantly outperforms Monodepth2 in terms of parameters and accuracy, thereby indicating its superiority in application to edge devices.
科研通智能强力驱动
Strongly Powered by AbleSci AI