聚类分析
计算机科学
相关聚类
CURE数据聚类算法
单连锁聚类
数据挖掘
图形
稳健性(进化)
树冠聚类算法
数学
算法
模式识别(心理学)
人工智能
理论计算机科学
生物化学
化学
基因
作者
Zhiguo Long,Yang Gao,Hua Meng,Yuqin Yao,Tianrui Li
标识
DOI:10.1016/j.ins.2022.03.091
摘要
Clustering by fast search and find of density peaks (DPC) is a widely used and studied clustering algorithm. In this article, we notice that DPC can achieve highly accurate clustering results when restricted to local neighborhoods. Therefore, by investigating density information in local neighborhoods, we propose to capture latent structures in data with family trees, which can reflect density dominations among nearest neighbors of data. A data set will then be partitioned into multiple family trees. In order to obtain the final clustering result, instead of exploiting the error-prone allocation strategy of DPC, we first elaborately design a novel similarity measure for family trees, characterizing not only the distance between data points, but also the structure of trees. Then, we adapt graph cut for the corresponding connection graph to also take global structural information into account. Extensive experiments on both real-world and synthetic data sets show that the proposed algorithm can outperform several prominent clustering algorithms for most of the cases, including the DPC and spectral clustering algorithms and some of their latest variants. We also analyze the robustness of the proposed algorithm w.r.t. hyper-parameters and its time complexity, as well as the necessity of its components through ablation study.
科研通智能强力驱动
Strongly Powered by AbleSci AI