计算机科学
图像检索
最近邻搜索
局部敏感散列
k-最近邻算法
散列函数
最佳垃圾箱优先
图像(数学)
情报检索
秩(图论)
模式识别(心理学)
哈希表
数据挖掘
人工智能
数学
计算机安全
组合数学
作者
Vinicius Kawai,Lucas Pascotti Valem,Alexandro Baldassin,Edson Borin,Daniel Carlos Guimarães Pedronette,Longin Jan Latecki
摘要
The large and growing amount of digital data creates a pressing need for approaches capable of indexing and retrieving multimedia content. A traditional and fundamental challenge consists of effectively and efficiently performing nearest-neighbor searches. After decades of research, several different methods are available, including trees, hashing, and graph-based approaches. Most of the current methods exploit learning to hash approaches based on deep learning. In spite of effective results and compact codes obtained, such methods often require a significant amount of labeled data for training. Unsupervised approaches also rely on expensive training procedures usually based on a huge amount of data. In this work, we propose an unsupervised data-independent approach for nearest neighbor searches, which can be used with different features, including deep features trained by transfer learning. The method uses a rank-based formulation and exploits a hashing approach for efficient ranked list computation at query time. A comprehensive experimental evaluation was conducted on seven public datasets, considering deep features based on CNNs and Transformers. Both effectiveness and efficiency aspects were evaluated. The proposed approach achieves remarkable results in comparison to traditional and state-of-the-art methods. Hence, it is an attractive and innovative solution, especially when costly training procedures need to be avoided.
科研通智能强力驱动
Strongly Powered by AbleSci AI