体素
点云
计算机科学
人工智能
激光雷达
计算机视觉
代表(政治)
模式识别(心理学)
目标检测
特征(语言学)
对象(语法)
遥感
地理
哲学
法学
政治
语言学
政治学
作者
Deze Zhao,Shengjie Zhao,Shuang Liang
标识
DOI:10.1007/978-981-99-7025-4_17
摘要
Voxel-based 3D object detection methods have gained more popularity in autonomous driving. However, due to the sparse nature of LiDAR point clouds, voxels from conventional cubic partition lead to incomplete representation of objects in farther range. This poses significant challenges to 3D object perception. In this paper, we propose a novel 3D object detector dubbed SVFNeXt, a Sparse Voxel Fusion Network that performs cross-representation (X) feature learning. It is because cylindrical voxel representation considers the rotational or radial scanning of LiDAR that we can better explore the inherent 3D geometric structure of point clouds. To further enchance cubic voxel features, we innovatively integrates the features of cylindrical voxels into cubic voxels, incorporating both local and global features. We particularly attend to informative voxels by two additional losses, striking a good speed-accuracy tradeoff. Extensive experiments on the WOD and KITTI datasets demonstrate consistent improvements over baselines. Our SVFNeXt achieves competitive results compared to state-of-the-art methods, especially for small objects(e.g., cyclist, pedestrian).
科研通智能强力驱动
Strongly Powered by AbleSci AI