计算机科学
公制(单位)
串联(数学)
散列函数
边距(机器学习)
代表(政治)
人工智能
保险丝(电气)
深度学习
情报检索
机器学习
数据挖掘
数学
运营管理
计算机安全
电气工程
组合数学
政治
政治学
法学
经济
工程类
作者
Jianguo Zhu,Xiaohu Ruan,Yongli Cheng,Zhangmin Huang,Yu Cui,Lingfang Zeng
标识
DOI:10.1109/icme55011.2023.00335
摘要
Learning the hash representation of multi-view heterogeneous data is an important task in multimedia retrieval. However, existing methods fail to effectively fuse the multi-view features and utilize the metric information provided by the dissimilar samples, leading to limited retrieval precision. Current methods utilize weighted sum or concatenation to fuse the multi-view features. We argue that these fusion methods cannot capture the interaction among different views. Furthermore, these methods ignored the information provided by the dissimilar samples. We propose a novel deep metric multi-view hashing (DMMVH) method to address the mentioned problems. Extensive empirical evidence is presented to show that gate-based fusion is better than typical methods. We introduce deep metric learning to the multi-view hashing problems, which can utilize metric information of dissimilar samples. On the MIR-Flickr25K, MS COCO, and NUS-WIDE, our method outperforms the current state-of-the-art methods by a large margin (up to 15.28 mean Average Precision (mAP) improvement).
科研通智能强力驱动
Strongly Powered by AbleSci AI