计算机科学
BitTorrent跟踪器
人工智能
匹配(统计)
特征(语言学)
跟踪(教育)
计算机视觉
弹道
模式识别(心理学)
视频跟踪
粒度
对象(语法)
眼动
数学
心理学
教育学
语言学
统计
哲学
物理
天文
操作系统
作者
Hong Zhang,Wanli Xing,Yifan Yang,Yan Li,Ding Yuan
标识
DOI:10.1016/j.ins.2023.03.083
摘要
Recently, Siamese trackers have achieved remarkable tracking performance. However, challenges such as accurate feature representation of targets with different spatial regions and the utilization of diverse target temporal states still need to be addressed. Here, we proposed a Siamese network with spatio-temporal awareness called SiamST. The standard square convolutional kernels and the single feature matching operation hardly represent the targets with different shapes accurately. Therefore, we designed a refined region fusion module that combines multiple convolutional kernels to fit targets with different aspect ratios. Furthermore, we proposed a multi-granularity matching module to obtain more robust feature matching results by combining fine-grained and coarse-grained matching results. However, most existing Siamese trackers do not adequately employ target temporal states. They usually only update the templates, which automatically causes motion information loss. Therefore, we built dynamic templates by screening high-quality samples to describe the target appearance changes accurately. In addition, we designed a trend guidance module to adjust the location prior constraint appropriately to match the tracking results to the target's motion trajectory. Extensive experimental results on eight tracking benchmarks demonstrate the competitive performance of SiamST compared to many advanced trackers.
科研通智能强力驱动
Strongly Powered by AbleSci AI