聚类分析
主成分分析
k-最近邻算法
维数(图论)
计算机科学
计算
最近邻链算法
模式识别(心理学)
数据挖掘
算法
人工智能
数学
树冠聚类算法
相关聚类
组合数学
作者
Mingjing Du,Shifei Ding,Hongjie Jia
标识
DOI:10.1016/j.knosys.2016.02.001
摘要
Density peaks clustering (DPC) algorithm published in the US journal Science in 2014 is a novel clustering algorithm based on density. It needs neither iterative process nor more parameters. However, original algorithm only has taken into account the global structure of data, which leads to missing many clusters. In addition, DPC does not perform well when data sets have relatively high dimension. Especially, DPC generates wrong number of clusters of real-world data sets. In order to overcome the first problem, we propose a density peaks clustering based on k nearest neighbors (DPC-KNN) which introduces the idea of k nearest neighbors (KNN) into DPC and has another option for the local density computation. In order to overcome the second problem, we introduce principal component analysis (PCA) into the model of DPC-KNN and further bring forward a method based on PCA (DPC-KNN-PCA), which preprocesses high-dimensional data. By experiments on synthetic data sets, we demonstrate the feasibility of our algorithms. By experiments on real-world data sets, we compared this algorithm with k-means algorithm and spectral clustering (SC) algorithm in accuracy. Experimental results show that our algorithms are feasible and effective.
科研通智能强力驱动
Strongly Powered by AbleSci AI