聚类分析
计算机科学
层次聚类
选择(遗传算法)
核密度估计
相关聚类
CURE数据聚类算法
单连锁聚类
数据挖掘
确定数据集中的群集数
芯(光纤)
模式识别(心理学)
人工智能
数学
统计
电信
估计员
作者
Zhicheng Shi,Renzhong Guo,Zhigang Zhao
摘要
Abstract Clustering is one of the most prevalent and important data mining algorithms ever developed. Currently, most clustering methods are divided into distance‐based and density‐based. In 2014, the fast search and find of density peaks clustering method was proposed, which is simple and effective and has been extensively applied in several research domains. However, the original version requires manually assigning a cut‐off distance and selecting core points. Therefore, this article improves the density peak clustering method from two aspects. First, the Gaussian kernel is substituted with a k‐nearest neighbors method to calculate local density. This is important as compared with selecting a cut‐off distance, calculating the k ‐value is easier. Second, the core points are automatically selected, unlike the original method that manually selects the core points regarding local density and distance distribution. Given that users' selection influences the clustering result, the proposed automatic core point selection strategy overcomes the human interference problem. Additionally, in the clustering process, the proposed method reduces the influence of manually assigned parameters.
科研通智能强力驱动
Strongly Powered by AbleSci AI