稳健性(进化)
情态动词
人工智能
计算机科学
杠杆(统计)
BitTorrent跟踪器
边距(机器学习)
公制(单位)
基本事实
模式识别(心理学)
特征提取
机器学习
数据挖掘
算法
眼动
工程类
化学
高分子化学
基因
生物化学
运营管理
作者
Zhengzheng Tu,Chun C. Lin,Wei Zhao,Chenglong Li,Jin Tang
出处
期刊:IEEE transactions on image processing
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:31: 85-98
被引量:5
标识
DOI:10.1109/tip.2021.3125504
摘要
Classifying hard samples in the course of RGBT tracking is a quite challenging problem. Existing methods only focus on enlarging the boundary between positive and negative samples, but ignore the relations of multilevel hard samples, which are crucial for the robustness of hard sample classification. To handle this problem, we propose a novel Multi-Modal Multi-Margin Metric Learning framework named M5L for RGBT tracking. In particular, we divided all samples into four parts including normal positive, normal negative, hard positive and hard negative ones, and aim to leverage their relations to improve the robustness of feature embeddings, e.g., normal positive samples are closer to the ground truth than hard positive ones. To this end, we design a multi-modal multi-margin structural loss to preserve the relations of multilevel hard samples in the training stage. In addition, we introduce an attention-based fusion module to achieve quality-aware integration of different source data. Extensive experiments on large-scale datasets testify that our framework clearly improves the tracking performance and performs favorably the state-of-the-art RGBT trackers.
科研通智能强力驱动
Strongly Powered by AbleSci AI