Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search

计算机科学散列函数动态完美哈希判别式成对比较人工智能通用哈希模式识别（心理学）最近邻搜索语义相似性局部敏感散列哈希表相似性（几何）特征哈希机器学习数据挖掘情报检索双重哈希图像（数学）计算机安全

作者

Di Wang,Xinbo Gao,Wang Xiu,Lihuo He

出处

期刊：IEEE Transactions on Pattern Analysis and Machine Intelligence [Institute of Electrical and Electronics Engineers]
日期：2019-10-01 卷期号：41 (10): 2466-2479 被引量：140

链接

nih.govdoi.org

标识

DOI：10.1109/tpami.2018.2861000

摘要

Multimodal hashing has attracted much interest for cross-modal similarity search on large-scale multimedia data sets because of its efficiency and effectiveness. Recently, supervised multimodal hashing, which tries to preserve the semantic information obtained from the labels of training data, has received considerable attention for its higher search accuracy compared with unsupervised multimodal hashing. Although these algorithms are promising, they are mainly designed to preserve pairwise similarities. When semantic labels of training data are given, the algorithms often transform the labels into pairwise similarities, which gives rise to the following problems: (1) constructing pairwise similarity matrix requires enormous storage space and a large amount of calculation, making these methods unscalable to large-scale data sets; (2) transforming labels into pairwise similarities loses the category information of the training data. Therefore, these methods do not enable the hash codes to preserve the discriminative information reflected by labels and, hence, the retrieval accuracies of these methods are affected. To address these challenges, this paper introduces a simple yet effective supervised multimodal hashing method, called label consistent matrix factorization hashing (LCMFH), which focuses on directly utilizing semantic labels to guide the hashing learning procedure. Considering that relevant data from different modalities have semantic correlations, LCMFH transforms heterogeneous data into latent semantic spaces in which multimodal data from the same category share the same representation. Therefore, hash codes quantified by the obtained representations are consistent with the semantic labels of the original data and, thus, can have more discriminative power for cross-modal similarity search tasks. Thorough experiments on standard databases show that the proposed algorithm outperforms several state-of-the-art methods.

求助该文献

最长约 10秒，即可获得该文献文件

Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search

今日热心研友