模式识别(心理学)
计算机科学
散列函数
相似性(几何)
汉明空间
人工智能
聚类分析
特征(语言学)
局部敏感散列
汉明距离
语义相似性
图像检索
一致性(知识库)
数学
哈希表
汉明码
图像(数学)
算法
解码方法
语言学
哲学
计算机安全
区块代码
作者
Chunyan Xie,Gao Yunmei,Zhou Qiyao,Jing Zhou
标识
DOI:10.1016/j.ins.2023.119543
摘要
In unsupervised cross-modal hashing, there are two notable issues that require attention. The inter- and intra-modal similarity matrices in the original and Hamming spaces lack sufficient neighborhood information and semantic consistency, while solely relying on the reconstruction of instance-level similarity matrices fails to effectively capture the global intrinsic correlation and manifold structure of the training samples. We propose a novel method that combines multi-similarity reconstructing with clustering-based contrastive hashing. Firstly, we construct image feature, text feature and joint-semantic feature multi-similarity matrices in their original space, along with their corresponding hashing code similarity matrices in the Hamming space, to enhance the semantic consistency of the inter-and intra-modal reconstructions. Secondly, the clustering-based contrastive hashing is proposed to capture the global intrinsic correlation and manifold structure of the image-text pairs. Extensive experiment results on Wiki, NUS-WIDE, MIRFlickr-25K and MS-COCO demonstrate the promising cross-modal retrieval performance of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI