聚类分析
计算
维数之咒
比例(比率)
计算机科学
算法
简单(哲学)
k-最近邻算法
数学
人工智能
物理
量子力学
认识论
哲学
作者
Yewang Chen,Xiaoliang Hu,Wentao Fan,Shen Lianlian,Zheng Zhang,Xin Liu,Ji‐Xiang Du,Haibo Li,Yi Chen,Hailin Li
标识
DOI:10.1016/j.knosys.2019.06.032
摘要
Density Peak (DPeak) clustering algorithm is not applicable for large scale data, due to two quantities, i.e, ρ and δ, are both obtained by brute force algorithm with complexity O(n2). Thus, a simple but fast DPeak, namely FastDPeak,1 is proposed, which runs in about O(nlog(n)) expected time in the intrinsic dimensionality. It replaces density with kNN-density, which is computed by fast kNN algorithm such as cover tree, yielding huge improvement for density computations. Based on kNN-density, local density peaks and non-local density peaks are identified, and a fast algorithm, which uses two different strategies to compute δ for them, is also proposed with complexity O(n). Experimental results show that FastDPeak is effective and outperforms other variants of DPeak.
科研通智能强力驱动
Strongly Powered by AbleSci AI