聚类分析
相关聚类
CURE数据聚类算法
模糊聚类
光谱聚类
单连锁聚类
数据流聚类
高维数据聚类
计算机科学
相似性(几何)
数据挖掘
约束聚类
共识聚类
模式识别(心理学)
人工智能
k-中位数聚类
图像(数学)
作者
Feiping Nie,Xiaoqian Wang,Heng Huang
出处
期刊:Knowledge Discovery and Data Mining
日期:2014-08-24
被引量:594
标识
DOI:10.1145/2623330.2623726
摘要
Many clustering methods partition the data groups based on the input data similarity matrix. Thus, the clustering results highly depend on the data similarity learning. Because the similarity measurement and data clustering are often conducted in two separated steps, the learned data similarity may not be the optimal one for data clustering and lead to the suboptimal results. In this paper, we propose a novel clustering model to learn the data similarity matrix and clustering structure simultaneously. Our new model learns the data similarity matrix by assigning the adaptive and optimal neighbors for each data point based on the local distances. Meanwhile, the new rank constraint is imposed to the Laplacian matrix of the data similarity matrix, such that the connected components in the resulted similarity matrix are exactly equal to the cluster number. We derive an efficient algorithm to optimize the proposed challenging problem, and show the theoretical analysis on the connections between our method and the K-means clustering, and spectral clustering. We also further extend the new clustering model for the projected clustering to handle the high-dimensional data. Extensive empirical results on both synthetic data and real-world benchmark data sets show that our new clustering methods consistently outperforms the related clustering approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI