计算机科学
行人检测
点云
人工智能
卷积神经网络
任务(项目管理)
特征(语言学)
频道(广播)
目标检测
联营
特征提取
计算机视觉
行人
模式识别(心理学)
工程类
语言学
哲学
系统工程
计算机网络
运输工程
作者
Duy Tho Le,Hao Shi,Hamid Rezatofighi,Jianfei Cai
出处
期刊:IEEE robotics and automation letters
日期:2023-02-01
卷期号:8 (2): 1159-1166
被引量:5
标识
DOI:10.1109/lra.2022.3233234
摘要
Efficiently and accurately detecting people from 3D point cloud data is of great importance in many robotic and autonomous driving applications. This fundamental perception task is still very challenging due to (i) significant deformations of human body pose and gesture over time and (ii) point cloud sparsity and scarcity for pedestrian objects. Recent efficient 3D object detection approaches rely on pillar features. However, these pillar features do not carry sufficient expressive representations to deal with all the aforementioned challenges in detecting people. To address this shortcoming, we first introduce a stackable Pillar Aware Attention (PAA) module to enhance pillar feature extraction while suppressing noises in point clouds. By integrating multi-point-channel-pooling, point-wise, channel-wise, and task-aware attention into a simple module, representation capabilities of pillar features are boosted while only requiring little additional computational resources. We also present Mini-BiFPN, a small yet effective feature network that creates bidirectional information flow and multi-level cross-scale feature fusion to better integrate multi-resolution features. Our proposed framework, namely PiFeNet, has been evaluated on three popular large-scale datasets for 3D pedestrian Detection, i.e. KITTI, JRDB, and nuScenes. It achieves state-of-the-art performance on KITTI Bird-eye-view (BEV) as well as JRDB, and competitive performance on nuScenes. Our approach is a real-time detector with 26 frame-per-second (FPS).
科研通智能强力驱动
Strongly Powered by AbleSci AI