计算机科学
人工智能
点云
计算机视觉
体素
分割
模式识别(心理学)
目标检测
保险丝(电气)
特征(语言学)
特征提取
残余物
图形
语言学
哲学
算法
理论计算机科学
电气工程
工程类
作者
Tianxiang Chen,Chao Han
标识
DOI:10.1117/1.jei.32.5.053039
摘要
Three-dimensional (3D) object detection is crucial for accurate recognition of autonomous driving roads, and the distribution of point clouds in 3D scenes becomes sparse with increasing distance, thus seriously affecting the sensor’s perception precision. To address this problem, we propose a two-stage 3D object detection network based on point and voxel feature fusion. In the first stage, a spatial semantic feature fusion module is designed to effectively fuse low-level spatial features and high-level semantic features to generate high-quality proposals. Then, an attention mechanism-based residual module is constructed to expand the receptive field and adaptively aggregate the voxel features in the 3D scene. At the same time, the sampled key points and voxel features are fused to extract the key information in the 3D scene. In the second stage, the graph network pooling module is introduced to construct local graphs on 3D proposals using key point features as nodes to estimate the confidence and location of objects more accurately. Experimental results on the KITTI dataset show that the detection precision is improved significantly in easy, moderate, and hard tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI