分割
计算机科学
人工智能
磁道(磁盘驱动器)
联想(心理学)
计算机视觉
跟踪(教育)
目标检测
运动(物理)
编码(集合论)
深度学习
变压器
数据关联
运动检测
视频跟踪
对象(语法)
工程类
心理学
教育学
哲学
集合(抽象数据类型)
认识论
电压
概率逻辑
电气工程
程序设计语言
操作系统
作者
Kaer Huang,Bingchuan Sun,Feng Chen,Tao Zhang,Jun Xie,Jian Li,Christopher Walter Twombly,Zhepeng Wang
出处
期刊:Cornell University - arXiv
日期:2023-01-01
被引量:1
标识
DOI:10.48550/arxiv.2308.01622
摘要
In recent years, dominant Multi-object tracking (MOT) and segmentation (MOTS) methods mainly follow the tracking-by-detection paradigm. Transformer-based end-to-end (E2E) solutions bring some ideas to MOT and MOTS, but they cannot achieve a new state-of-the-art (SOTA) performance in major MOT and MOTS benchmarks. Detection and association are two main modules of the tracking-by-detection paradigm. Association techniques mainly depend on the combination of motion and appearance information. As deep learning has been recently developed, the performance of the detection and appearance model is rapidly improved. These trends made us consider whether we can achieve SOTA based on only high-performance detection and appearance model. Our paper mainly focuses on exploring this direction based on CBNetV2 with Swin-B as a detection model and MoCo-v2 as a self-supervised appearance model. Motion information and IoU mapping were removed during the association. Our method wins 1st place on the MOTS track and wins 2nd on the MOT track in the CVPR2023 WAD workshop. We hope our simple and effective method can give some insights to the MOT and MOTS research community. Source code will be released under this git repository
科研通智能强力驱动
Strongly Powered by AbleSci AI