计算机科学
人工智能
图形
推论
背景(考古学)
模式识别(心理学)
机器学习
场景图
理论计算机科学
作者
Ning Xu,An-An Liu,Yongkang Wong,Weizhi Nie,Yuting Su,Mohan S. Kankanhalli
出处
期刊:IEEE Transactions on Circuits and Systems for Video Technology
[Institute of Electrical and Electronics Engineers]
日期:2021-03-01
卷期号:31 (3): 1031-1041
被引量:5
标识
DOI:10.1109/tcsvt.2020.2990989
摘要
The scene graph generated for an image structurally represents its object interactions and it substantially aids image scene understanding. To the best of our knowledge, most current works on scene graph generation chiefly focus on pairwise object regions for object and relation inference while ignoring the global visual context outside of these regions. Guided by the intuition that object/relation inference can benefit from the visual context within an image, this paper proposes a multi-scale context modeling method, which can jointly discover and integrate the complementary object-centric and region-centric context for scene graph inference. While both the object-centric and region-centric contexts are separately modeled by their individual modules, a bi-directional message propagation strategy is designed to mutually reinforce the context modeling. A context-fused inference is then proposed to integrate the multi-scale context to guide scene graph inference. Extensive experiments establish that this method can achieve competitive performance compared to the state-of-the-art methods on three benchmarks. Additional ablation studies further validate its effectiveness. Code has been made available at: https://github.com/ningxu1990/MSCM .
科研通智能强力驱动
Strongly Powered by AbleSci AI