计算机科学
对偶(语法数字)
计算机视觉
人工智能
视频跟踪
跟踪(教育)
对象(语法)
运动(物理)
心理学
教育学
文学类
艺术
作者
NI Zhi-xiang,Chao Zhai,Yujun Li,Yang Yang
出处
期刊:IEEE Access
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:: 1-1
标识
DOI:10.1109/access.2024.3362673
摘要
For multi-object tracking (MOT), jointly learning the detector and embedding model (JDE) is one of the mainstream solutions. However, an inherent problem in this architecture arises as the tasks of target detection and appearance feature extraction compete with each other. FairMOT, as a representative method, attempts to address this issue by employing two homogeneous branches, but it overlooks the essential difference between these two tasks. Upon the original network architecture, we propose an adaptive dual decoder structure. Our objective is to separately learn more focused features for the target detection and the appearance feature extraction. Furthermore, we introduce a noise-adaptive Kalman filter based on the width estimation. In the motion information matching stage, we enhance the affinity matrix of motion information by employing an expanded-width strategy, combined with a more accurate overlap measure. We verify the effectiveness of our proposed approach through extensive experiments using the MOT17 dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI