计算机科学
散列函数
情报检索
双重哈希
二进制代码
情态动词
哈希表
通用哈希
二进制数
数学
计算机安全
算术
化学
高分子化学
作者
Lei Zhu,C. Zheng,Weili Guan,Jingjing Li,Yang Yang,Heng Tao Shen
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2023-06-05
卷期号:36 (1): 239-260
被引量:30
标识
DOI:10.1109/tkde.2023.3282921
摘要
With the explosive growth of multimedia contents, multimedia retrieval is facing unprecedented challenges on both storage cost and retrieval speed. Hashing technique can project the high-dimensional data into compact binary hash codes. With it, the most time-consuming semantic similarity computation during the multimedia retrieval process can be significantly accelerated with fast Hamming distance computation, and meanwhile the storage cost can be reduced greatly by the binary embedding. In the light of this, multi-modal hashing has recently received considerable attention to support large-scale multimedia retrieval. Different from uni-modal hashing, the multi-modal hashing focuses on modeling the multi-modal semantics and further preserving them into binary hash codes with hash learning. In this paper, we first systematically review the existing learning to hash methods for efficient multimedia retrieval, categorizing them according to the multimedia retrieval tasks, the specific multi-modal semantic modeling techniques, and hash learning strategies. Thereafter, we present the performance comparison results. We ultimately discuss the challenges and potential research directions that may require further investigation in multi-modal hash learning. To facilitate the research on multi-modal hashing, we develop an open-source performance comparison tool at https://github.com/BMC-SDNU/Hashing-Retrieval .
科研通智能强力驱动
Strongly Powered by AbleSci AI