计算机科学
模态(人机交互)
人工智能
水准点(测量)
可靠性(半导体)
视频跟踪
计算机视觉
点云
特征(语言学)
编码
目标检测
传感器融合
过程(计算)
对象(语法)
模式识别(心理学)
哲学
量子力学
功率(物理)
化学
生物化学
语言学
地理
大地测量学
物理
操作系统
基因
作者
Wen-Wei Zhang,Hui Zhou,Shuyang Sun,Zhe Wang,Jianping Shi,Chen Change Loy
出处
期刊:International Conference on Computer Vision
日期:2019-10-01
被引量:103
标识
DOI:10.1109/iccv.2019.00245
摘要
Multi-sensor perception is crucial to ensure the reliability and accuracy in autonomous driving system, while multi-object tracking (MOT) improves that by tracing sequential movement of dynamic objects. Most current approaches for multi-sensor multi-object tracking are either lack of reliability by tightly relying on a single input source (e.g., center camera), or not accurate enough by fusing the results from multiple sensors in post processing without fully exploiting the inherent information. In this study, we design a generic sensor-agnostic multi-modality MOT framework (mmMOT), where each modality (i.e., sensors) is capable of performing its role independently to preserve reliability, and further improving its accuracy through a novel multi-modality fusion module. Our mmMOT can be trained in an end-to-end manner, enables joint optimization for the base feature extractor of each modality and an adjacency estimator for cross modality. Our mmMOT also makes the first attempt to encode deep representation of point cloud in data association process in MOT. We conduct extensive experiments to evaluate the effectiveness of the proposed framework on the challenging KITTI benchmark and report state-of-the-art performance. Code and models are available at https://github.com/ZwwWayne/mmMOT.
科研通智能强力驱动
Strongly Powered by AbleSci AI