局部敏感散列
计算机科学
散列函数
序列(生物学)
余弦相似度
相似性(几何)
最近邻搜索
欧几里德距离
数据挖掘
情报检索
比例(比率)
哈希表
算法
模式识别(心理学)
人工智能
物理
生物
图像(数学)
量子力学
遗传学
计算机安全
作者
Shengyingjie Liu,Jianwen Sun,Zhi Liu,Xian Peng,Sannyuya Liu
标识
DOI:10.1145/3033288.3033318
摘要
Locality-sensitive hashing (LSH) considered as an efficient algorithm for large-scale similarity search has become increasingly popular. Recently, many of its variants have been applied widely in high-dimensional similarity search. To overcome the drawback of requirement for a large number of hash tables, researchers proposed the famous Multi-Probe LSH (MP-LSH). It has been used to improve the utilization of hash tables. There are two major probing sequences mentioned in MP-LSH, i.e., Step-Wise Probing (SWP) sequence and Query-Directed Probing (QDP) sequence. It is verified that QDP sequence is better than SWP sequence in number of probes and query time. However, the proposed QDP sequence is based on the E2LSH. It means that the method is only adopted for Euclidean distance. For cosine similarity, SWP sequence is still the only feasible method to perform Multi-Probe LSH.
科研通智能强力驱动
Strongly Powered by AbleSci AI