计算机科学
事件(粒子物理)
钥匙(锁)
任务(项目管理)
编码(集合论)
人工智能
机制(生物学)
循环神经网络
机器学习
班级(哲学)
人机交互
人工神经网络
计算机安全
集合(抽象数据类型)
哲学
物理
管理
认识论
量子力学
经济
程序设计语言
作者
Farzaneh Askari,Rohit Ramaprasad,James J. Clark,Martin D. Levine
标识
DOI:10.1109/cvprw56347.2022.00402
摘要
Interaction recognition from multi-person videos is a challenging yet essential task in computer vision. Often the videos depict actions with multiple actors involved, some of whom participate in the main event, and the rest are present in the scene without being part of the actual event. This paper proposes a model to tackle the problem of interaction recognition from multi-person videos. Our model consists of a Recurrent Neural Network (RNN) equipped with a time-varying attention mechanism. It receives scene features and localized actors features to predict the interaction class. Additionally, the attention model identifies the people responsible for the main event. We chose penalty classification from ice hockey broadcast videos as our application. These videos are multi-persons and depict complex interactions between players in a non-laboratory recording setup. We evaluate our model on a new dataset of ice hockey penalty videos and report 93.93% classification accuracy. We include a qualitative analysis of the attention mechanism by visualizing the attention weights. Our code is publicly available 1 .
科研通智能强力驱动
Strongly Powered by AbleSci AI