计算机科学
水准点(测量)
任务(项目管理)
对象(语法)
传感器融合
目标检测
比例(比率)
实时计算
计算机视觉
人工智能
工程类
模式识别(心理学)
物理
大地测量学
系统工程
量子力学
地理
作者
Haibao Yu,Yizhen Luo,Mao Shu,Yiyi Huo,Zebang Yang,Yifeng Shi,Zhenglong Guo,Hanyu Li,Xing Hu,Jirui Yuan,Zaiqing Nie
标识
DOI:10.1109/cvpr52688.2022.02067
摘要
Autonomous driving faces great safety challenges for a lack of global perspective and the limitation of long-range perception capabilities. It has been widely agreed that vehicle-infrastructure cooperation is required to achieve Level 5 autonomy. However, there is still NO dataset from real scenarios available for computer vision researchers to work on vehicle-infrastructure cooperation-related problems. To accelerate computer vision research and innovation for Vehicle-Infrastructure Cooperative Autonomous Driving (VICAD), we release DAIR-V2X Dataset, which is the first large-scale, multi-modality, multi-view dataset from real scenarios for VICAD. DAIR-V2X comprises 71254 LiDAR frames and 71254 Camera frames, and all frames are captured from real scenes with 3D annotations. The Vehicle-Infrastructure Cooperative 3D Object Detection problem (VIC3D) is introduced, formulating the problem of collaboratively locating and identifying 3D objects using sensory inputs from both vehicle and infrastructure. In addition to solving traditional 3D object detection problems, the solution of VIC3D needs to consider the temporal asynchrony problem between vehicle and infrastructure sensors and the data transmission cost between them. Furthermore, we propose Time Compensation Late Fusion (TCLF), a late fusion framework for the VIC3D task as a benchmark based on DAIR-V2X. Find data, code, and more up-to-date information at https://thudair.baai.ac.cn/index and https://github.com/AIR-Thu/dair-V2x.
科研通智能强力驱动
Strongly Powered by AbleSci AI