计算机科学
水准点(测量)
跳跃式监视
动作(物理)
帧(网络)
任务(项目管理)
领域(数学分析)
领域(数学)
人工智能
地理
地图学
量子力学
电信
物理
数学分析
经济
管理
纯数学
数学
作者
Zi-Hao Liu,Danfeng Yan,Yuanqiang Cai,Yan Song
标识
DOI:10.1016/j.patcog.2023.110087
摘要
Spatio-temporal action localization is a crucial and challenging task in the field of video understanding. Existing benchmarks for spatio-temporal action detection are limited by factors such as incomplete annotations, high-level non-universal actions, and uncommon scenarios. To address these limitations and facilitate research in real-world security applications, we introduce a novel human-centric dataset for spatio-temporal localization of atomic actions in indoor surveillance settings, termed as HIA (Human-centric Indoor Actions). The HIA dataset is constructed by selecting 30 atomic action classes, compiling 100 surveillance videos, and annotating 219,225 frames with 370,937 bounding boxes. The primary characteristics of HIA include (1) accurate spatio-temporal annotations for atomic actions, (2) human-centric annotations at the frame level, (3) temporal linking of persons across discontinuous tracks, and (4) utilization of indoor surveillance videos. Our HIA, with its realistic settings in indoor surveillance scenes and comprehensive annotations, presents a valuable and novel challenge to the spatio-temporal action localization domain. To establish a benchmark, we evaluate various methods and provide an in-depth analysis of the HIA dataset. The HIA dataset will be made available soon, and we anticipate that it will serve as a standard and practical benchmark for the research community.
科研通智能强力驱动
Strongly Powered by AbleSci AI