计算机科学
散列函数
人工智能
情态动词
索引(排版)
对偶(语法数字)
情报检索
编码(集合论)
内容(测量理论)
自然语言处理
模式识别(心理学)
数学
艺术
集合(抽象数据类型)
化学
高分子化学
程序设计语言
万维网
数学分析
文学类
计算机安全
作者
Bin Zhang,Yue Zhang,Junyu Li,Jiazhou Chen,Tatsuya Akutsu,Yiu‐ming Cheung,Hongmin Cai
标识
DOI:10.1109/tpami.2024.3467130
摘要
Hashing technology has exhibited great cross-modal retrieval potential due to its appealing retrieval efficiency and storage effectiveness. Most current supervised cross-modal retrieval methods heavily rely on accurate semantic supervision, which is intractable for annotations with ever-growing sample sizes. By comparison, the existing unsupervised methods rely on accurate sample similarity preservation strategies with intensive computational costs to compensate for the lack of semantic guidance, which causes these methods to lose the power to bridge the semantic gap. Furthermore, both kinds of approaches need to search for the nearest samples among all samples in a large search space, whose process is laborious. To address these issues, this paper proposes an unsupervised dual deep hashing (UDDH) method with semantic-index and content-code for cross-modal retrieval. Deep hashing networks are utilized to extract deep features and jointly encode the dual hashing codes in a collaborative manner with a common semantic index and modality content codes to simultaneously bridge the semantic and heterogeneous gaps for cross-modal retrieval. The dual deep hashing architecture, comprising the head code on semantic index and tail codes on modality content, enhances the efficiency for cross-modal retrieval. A query sample only needs to search for the retrieved samples with the same semantic index, thus greatly shrinking the search space and achieving superior retrieval efficiency. UDDH integrates the learning processes of deep feature extraction, binary optimization, common semantic index, and modality content code within a unified model, allowing for collaborative optimization to enhance the overall performance. Extensive experiments are conducted to demonstrate the retrieval superiority of the proposed approach over the state-of-the-art baselines.
科研通智能强力驱动
Strongly Powered by AbleSci AI