计算机科学
判别式
视觉文字
图形
图像自动标注
图像检索
卷积神经网络
语义学(计算机科学)
人工智能
模式识别(心理学)
图像(数学)
理论计算机科学
程序设计语言
作者
Ying Li,Hongwei Zhou,Yeyu Yin,Jiaquan Gao
出处
期刊:ACM Multimedia
日期:2021-10-17
被引量:8
标识
DOI:10.1145/3474085.3475695
摘要
Pattern images are artificially designed images which are discriminative in aspects of elements, styles, arrangements and so on. Pattern images are widely used in fields like textile, clothing, art, fashion and graphic design. With the growth of image numbers, pattern image retrieval has great potential in commercial applications and industrial production. However, most of existing content-based image retrieval works mainly focus on describing simple attributes with clear conceptual boundaries, which are not suitable for pattern image retrieval. It is difficult to accurately represent and retrieve pattern images which include complex details and multiple elements. Therefore, in this paper, we collect a new pattern image dataset with multiple labels per image for the pattern image retrieval task. To extract discriminative semantic features of multi-label pattern images and construct high-level topology relationships between features, we further propose an Attention Mechanism Driven Graph Convolutional Network (AMD-GCN). Different layers of the multi-semantic attention module activate regions of interest corresponding to multiple labels, respectively. By embedding the learned labels from attention module into the graph convolutional network, which can capture the dependency of labels on the graph manifold, the AMD-GCN builds an end-to-end framework to extract high-level semantic features with label semantics and inner relationships for retrieval. Experiments on the pattern image dataset show that the proposed method highlights the relevant semantic regions of multiple labels, and achieves higher accuracy than state-of-the-art image retrieval methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI