Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition

计算机科学人工智能利用多标签分类分类器（UML）图形语义学（计算机科学）知识图模式识别（心理学）人工神经网络依赖关系图机器学习理论计算机科学计算机安全程序设计语言

作者

Tianshui Chen,Liang Lin,Riquan Chen,Xiaolu Hui,Hefeng Wu

出处

期刊：IEEE Transactions on Pattern Analysis and Machine Intelligence [Institute of Electrical and Electronics Engineers]
日期：2020-09-30 卷期号：44 (3): 1371-1384 被引量：113

链接

arxiv.org arxiv.org nih.govdoi.org

标识

DOI：10.1109/tpami.2020.3025814

摘要

Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers, especially for labels with limited training samples. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.

求助该文献

最长约 10秒，即可获得该文献文件

Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition

今日热心研友