素描
计算机科学
图像翻译
水准点(测量)
领域(数学分析)
图像(数学)
人工智能
图像检索
空格(标点符号)
翻译(生物学)
模式识别(心理学)
计算机视觉
情报检索
算法
数学
数学分析
操作系统
信使核糖核酸
化学
基因
地理
生物化学
大地测量学
作者
Jiangtong Li,Zhixin Ling,Li Niu,Liqing Zhang
标识
DOI:10.1016/j.cviu.2022.103412
摘要
The goal of Sketch-Based Image Retrieval (SBIR) is using free-hand sketches to retrieve images of the same category from a natural image gallery. However, SBIR requires all test categories to be seen during training, which cannot be guaranteed in real-world applications. So we investigate more challenging Zero-Shot SBIR (ZS-SBIR), in which test categories do not appear in the training stage. After realizing that sketches mainly contain structure information while images contain additional appearance information, we attempt to achieve structure-aware retrieval via asymmetric disentanglement. For this purpose, we propose our STRucture-aware Asymmetric Disentanglement (STRAD) method, in which image features are disentangled into structure features and appearance features while sketch features are only projected to structure space. Through disentangling structure and appearance space, bi-directional domain translation is performed between the sketch domain and the image domain. Extensive experiments demonstrate that our STRAD method remarkably outperforms state-of-the-art methods on three large-scale benchmark datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI