Bing Li,Jie Chen,Xinde Li,Rui Xu,Qian Li,Yice Cao,Jun Wu,Lei Qu,Yingsong Li,Paulo S. R. Diniz
出处
期刊:IEEE Transactions on Intelligent Transportation Systems [Institute of Electrical and Electronics Engineers] 日期:2024-03-29卷期号:25 (9): 12034-12048被引量:2
标识
DOI:10.1109/tits.2024.3373227
摘要
In this work, we propose a voxel-based single-stage fine-grained and efficient point cloud 3D object detection algorithm to address the inadequate granularity in point cloud feature extraction tasks and the imbalance between efficiency and accuracy in single-stage point cloud 3D object detection scenarios. We develop a lightweight multibranch cross-sparse convolution network (LMCCN) that is designed to preserve the feature granularity of the original point cloud while achieving enhanced extraction efficiency. Additionally, we introduce a compact fine-grained self-attention augmented bird's eye view (BEV) feature extraction module (CFSAM). This module aims to further refine BEV features, enabling the acquisition of both locally and globally enhanced features and thereby augmentingthe perceptual capabilities of the constructed model. Without bells and whistles, the proposed method attains excellent performance on many autonomous driving benchmarks, with detection accuracies of up to 81.67% on KITTI, 72.74% on ONCE, and 84.00% on nuScenes. Moreover, it reaches a peak detection speed of 46.08 FPS, effectively balancing accuracy with speed.