随机投影
最近邻搜索
k-最近邻算法
计算机科学
随机图
投影(关系代数)
图形
随机森林
稀疏矩阵
距离矩阵
最近邻图
模式识别(心理学)
算法
理论计算机科学
人工智能
量子力学
物理
高斯分布
作者
Isuru Ranawaka,Md. Khaledur Rahman,Ariful Azad
标识
DOI:10.1109/ipdps54959.2023.00014
摘要
A random projection tree that partitions data points by projecting them onto random vectors is widely used for approximate nearest neighbor search in high-dimensional space. We consider a particular case of random projection trees for constructing a k-nearest neighbor graph (KNNG) from high-dimensional data. We develop a distributed-memory Random Projection Tree (DRPT) algorithm for constructing sparse random projection trees and then running a query on the forest to create the KNN graph. DRPT uses sparse matrix operations and a communication reduction scheme to scale KNN graph constructions to thousands of processes on a supercomputer. The accuracy of DRPT is comparable to state-of-the-art methods for approximate nearest neighbor search, while it runs two orders of magnitude faster than its peers. DRPT is available at https://github.com/HipGraph/DRPT.
科研通智能强力驱动
Strongly Powered by AbleSci AI