计算机科学
人工智能
帧(网络)
探测器
目标检测
任务(项目管理)
计算机视觉
视频跟踪
对象(语法)
过程(计算)
期限(时间)
跟踪(教育)
模式识别(心理学)
深度学习
特征提取
机器学习
卷积神经网络
特征(语言学)
BitTorrent跟踪器
工程类
操作系统
系统工程
物理
心理学
电信
量子力学
教育学
作者
Zdenek Kalal,Krystian Mikolajczyk,Jiri Matas
标识
DOI:10.1109/tpami.2011.239
摘要
This paper investigates long-term tracking of unknown objects in a video stream. The object is defined by its location and extent in a single frame. In every frame that follows, the task is to determine the object's location and extent or indicate that the object is not present. We propose a novel tracking framework (TLD) that explicitly decomposes the long-term tracking task into tracking, learning, and detection. The tracker follows the object from frame to frame. The detector localizes all appearances that have been observed so far and corrects the tracker if necessary. The learning estimates the detector's errors and updates it to avoid these errors in the future. We study how to identify the detector's errors and learn from them. We develop a novel learning method (P-N learning) which estimates the errors by a pair of “experts”: (1) P-expert estimates missed detections, and (2) N-expert estimates false alarms. The learning process is modeled as a discrete dynamical system and the conditions under which the learning guarantees improvement are found. We describe our real-time implementation of the TLD framework and the P-N learning. We carry out an extensive quantitative evaluation which shows a significant improvement over state-of-the-art approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI