计算机科学
人工智能
任务(项目管理)
背景(考古学)
突出
集合(抽象数据类型)
依赖关系(UML)
视觉推理
动作(物理)
情报检索
机器学习
自然语言处理
古生物学
物理
管理
量子力学
经济
生物
程序设计语言
作者
Thilini Cooray,Ngai‐Man Cheung,Wei Lu
标识
DOI:10.1109/cvpr42600.2020.00479
摘要
Situation Recognition (SR) is a fine-grained action recognition task where the model is expected to not only predict the salient action of the image, but also predict values of all associated semantic roles of the action. Predicting semantic roles is very challenging: a vast variety of possibilities can be the match for a semantic role. Existing work has focused on dependency modelling architectures to solve this issue. Inspired by the success achieved by query-based visual reasoning (e.g., Visual Question Answering), we propose to address semantic role prediction as a query-based visual reasoning problem. However, existing query-based reasoning methods have not considered handling of inter-dependent queries which is a unique requirement of semantic role prediction in SR. Therefore, to the best of our knowledge, we propose the first set of methods to address inter-dependent queries in query-based visual reasoning. Extensive experiments demonstrate the effectiveness of our proposed method which achieves outstanding performance on Situation Recognition task. Furthermore, leveraging query inter-dependency, our methods improve upon a state-of-the-art method that answers queries separately. Our code: https://github.com/thilinicooray/context-aware-reasoning-for-sr.
科研通智能强力驱动
Strongly Powered by AbleSci AI