计算机科学
水准点(测量)
人工智能
杂乱
BitTorrent跟踪器
无人机
判别式
最小边界框
特征提取
边距(机器学习)
跟踪(教育)
计算机视觉
眼动
模式识别(心理学)
机器学习
雷达
地理
生物
图像(数学)
电信
遗传学
教育学
心理学
大地测量学
作者
Bo Huang,Jianan Li,Junjie Chen,Gang Wang,Jian Zhao,Tingfa Xu
标识
DOI:10.1109/tpami.2023.3335338
摘要
The perception of drones, also known as Unmanned Aerial Vehicles (UAVs), particularly in infrared videos, is crucial for effective anti-UAV tasks. However, existing datasets for UAV tracking have limitations in terms of target size and attribute distribution characteristics, which do not fully represent complex realistic scenes. To address this issue, we introduce a generalized infrared UAV tracking benchmark called Anti-UAV410. The benchmark comprises a total of 410 videos with over 438 K manually annotated bounding boxes. To tackle the challenges of UAV tracking in complex environments, we propose a novel method called Siamese drone tracker (SiamDT). SiamDT incorporates a dual-semantic feature extraction mechanism that explicitly models targets in dynamic background clutter, enabling effective tracking of small UAVs. The SiamDT method consists of three key steps: Dual-Semantic RPN Proposals (DS-RPN), Versatile R-CNN (VR-CNN), and Background Distractors Suppression. These steps are responsible for generating candidate proposals, refining prediction scores based on dual-semantic features, and enhancing the discriminative capacity of the trackers against dynamic background clutter, respectively. Extensive experiments conducted on the Anti-UAV410 dataset and three other large-scale benchmarks demonstrate the superior performance of the proposed SiamDT method compared to recent state-of-the-art trackers.
科研通智能强力驱动
Strongly Powered by AbleSci AI