计算机科学
人工智能
多标签分类
图形
模式识别(心理学)
集合(抽象数据类型)
图像(数学)
相似性(几何)
上下文图像分类
机器学习
理论计算机科学
程序设计语言
作者
Xuelin Zhu,Jian K. Liu,Weijia Liu,Jiawei Ge,Bo Liu,Jiuxin Cao
标识
DOI:10.1109/iccv51070.2023.00142
摘要
Multi-label image classification refers to assigning a set of labels for an image. One of the main challenges of this task is how to effectively capture the correlation among labels. Existing studies on this issue mostly rely on the statistical label co-occurrence or semantic similarity of labels. However, an important fact is ignored that the co-occurrence of labels is closely related with image scenes (indoor, outdoor, etc.), which is a vital characteristic in multi-label image classification. In this paper, a novel scene-aware label graph learning framework is proposed, which is capable of learning visual representations for labels while fully perceiving their co-occurrence relationships under variable scenes. Specifically, our framework is able to detect scene categories of images without relying on manual annotations, and keeps track of the co-occurring labels by maintaining a global co-occurrence matrix for each scene category throughout the whole training phase. These scene-independent co-occurrence matrices are further employed to guide the interactions among label representations in a graph propagation manner towards accurate label prediction. Extensive experiments on public benchmarks demonstrate the superiority of our framework.
科研通智能强力驱动
Strongly Powered by AbleSci AI