人工智能
RGB颜色模型
计算机科学
计算机视觉
水准点(测量)
跳跃式监视
单眼
卷积神经网络
目标检测
深度学习
图像(数学)
融合
深度图
图像融合
模式识别(心理学)
哲学
语言学
地理
大地测量学
作者
Fahimeh Farahnakian,Jukka Heikkonen
标识
DOI:10.1109/icspcs50536.2020.9310031
摘要
We present an early fusion framework for robust object detection in autonomous vehicles. This framework firstly employs Monodepth as a self-supervised learning method to automatically infer a dense depth image from a single color input image. Then, the RGB image and its corresponding depth image are processed by a deep Convolutional Neural Networks (CNNs) to predict multiple 2D bounding boxes. We conduct experiments on the challenging KITTI benchmark dataset. The experimental results show that the features learnt from our fusion framework, when fused with the features learnt from depth-only and RGB-only architectures, outperform the state of the art on RGB-depth category recognition. We also investigated on performance of our fusion framework when it utilizes various sources (such as monocular and stereo imagery or both imageries) for generating the depth image.
科研通智能强力驱动
Strongly Powered by AbleSci AI