计算机科学
判别式
Boosting(机器学习)
图像检索
搜索引擎索引
量化(信号处理)
数据挖掘
人工智能
情报检索
机器学习
图像(数学)
算法
作者
Eleftherios Spyromitros-Xioufis,Symeon Papadopoulos,Ioannis Kompatsiaris,Grigorios Tsoumakas,Ioannis Vlahavas
标识
DOI:10.1109/tmm.2014.2329648
摘要
This paper deals with content-based large-scale image retrieval using the state-of-the-art framework of VLAD and Product Quantization proposed by Jegou as a starting point. Demonstrating an excellent accuracy-efficiency trade-off, this framework has attracted increased attention from the community and numerous extensions have been proposed. In this work, we make an in-depth analysis of the framework that aims at increasing our understanding of its different processing steps and boosting its overall performance. Our analysis involves the evaluation of numerous extensions (both existing and novel) as well as the study of the effects of several unexplored parameters. We specifically focus on: a) employing more efficient and discriminative local features; b) improving the quality of the aggregated representation; and c) optimizing the indexing scheme. Our thorough experimental evaluation provides new insights into extensions that consistently contribute, and others that do not, to performance improvement, and sheds light onto the effects of previously unexplored parameters of the framework. As a result, we develop an enhanced framework that significantly outperforms the previous best reported accuracy results on standard benchmarks and is more efficient.
科研通智能强力驱动
Strongly Powered by AbleSci AI