计算机科学
散列函数
成对比较
人工智能
依赖关系(UML)
对象(语法)
模式识别(心理学)
图像检索
特征(语言学)
数据挖掘
特征提取
图像(数学)
利用
目标检测
机器学习
语言学
哲学
计算机安全
作者
Liangkang Peng,Jiangbo Qian,Zhengtao Xu,Xin Yu,Lijun Guo
出处
期刊:IEEE transactions on image processing
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:32: 1759-1773
被引量:3
标识
DOI:10.1109/tip.2023.3251028
摘要
Learning hash functions have been widely applied for large-scale image retrieval. Existing methods usually use CNNs to process an entire image at once, which is efficient for single-label images but not for multi-label images. First, these methods cannot fully exploit independent features of different objects in one image, resulting in some small object features with important information being ignored. Second, the methods cannot capture different semantic information from dependency relations among objects. Third, the existing methods ignore the impacts of imbalance between hard and easy training pairs, resulting in suboptimal hash codes. To address these issues, we propose a novel deep hashing method, termed multi-label hashing for dependency relations among multiple objectives (DRMH). We first utilize an object detection network to extract object feature representations to avoid ignoring small object features and then fuse object visual features with position features and further capture dependency relations among objects using a self-attention mechanism. In addition, we design a weighted pairwise hash loss to solve the imbalance problem between hard and easy training pairs. Extensive experiments are conducted on multi-label datasets and zero-shot datasets, and the proposed DRMH outperforms many state-of-the-art hashing methods with respect to different evaluation metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI