人工智能
计算机视觉
目标检测
单眼
计算机科学
单目视觉
分割
像面
偏移量(计算机科学)
方向(向量空间)
行人检测
Viola–Jones对象检测框架
对象(语法)
姿势
特征提取
工程类
数学
图像(数学)
人脸检测
几何学
程序设计语言
行人
面部识别系统
运输工程
作者
Muhamad Amirul Haq,Shanq-Jang Ruan,Mei-En Shao,Qazi Mazhar ul Haq,Pei-Jung Liang,De-Qin Gao
出处
期刊:IEEE Transactions on Intelligent Transportation Systems
[Institute of Electrical and Electronics Engineers]
日期:2022-11-01
卷期号:23 (11): 21630-21640
被引量:4
标识
DOI:10.1109/tits.2022.3175198
摘要
On-road object detection is a critical component in an autonomous driving system. The safety of the vehicle can only be as good as the reliability of the on-road object detection system. Thus, developing a fast and robust object detection algorithm has been the primary goal of many automotive industries and institutes. In recent years, multi-purpose vision-based driver assistance systems have gained popularity with the emergence of a deep neural network. A monocular camera has been developed to locate an object in the image plane and estimate the distance of the said object in the real world or the vehicle plane. In this work, we present a monocular 3D object detection method that utilizes the discrete depth and orientation representation. Our proposed method strives to predict object locations on 3D space utilizing keypoint detection on the object’s center point. To improve the point detection, we employ center regression on the objects segmentation mask, reducing the detection offset significantly. The simplicity of our proposed network architecture and its one-stage approach allows our algorithm to achieve competitive speed compared with prior methods. Our proposed method is able to achieve 26.93% detection score on the Cityscapes 3D object detection dataset, outperforming the preceding monocular method by a margin of 2.8 points.
科研通智能强力驱动
Strongly Powered by AbleSci AI