轮廓
聚类分析
相关聚类
相似性(几何)
CURE数据聚类算法
数学
模式识别(心理学)
模糊聚类
星团(航天器)
确定数据集中的群集数
计算机科学
k-中位数聚类
单连锁聚类
人工智能
数据挖掘
图像(数学)
程序设计语言
作者
Adil M. Bagirov,Ramiz M. Alıguliyev,Nargiz Sultanova
标识
DOI:10.1016/j.patcog.2022.109144
摘要
Finding compact and well-separated clusters in data sets is a challenging task. Most clustering algorithms try to minimize certain clustering objective functions. These functions usually reflect the intra-cluster similarity and inter-cluster dissimilarity. However, the use of such functions alone may not lead to the finding of well-separated and, in some cases, compact clusters. Therefore additional measures, called cluster validity indices, are used to estimate the true number of well-separated and compact clusters. Some of these indices are well-suited to be included into the optimization model of the clustering problem. Silhouette coefficients are among such indices. In this paper, a new optimization model of the clustering problem is developed where the clustering function is used as an objective and silhouette coefficients are used to formulate constraints. Then an algorithm, called CLUSCO (CLustering Using Silhouette COefficients), is designed to construct clusters incrementally. Three schemes are discussed to reduce the computational complexity of the algorithm. Its performance is evaluated using fourteen real-world data sets and compared with that of three state-of-the-art clustering algorithms. Results show that the CLUSCO is able to compute compact clusters which are significantly better separable in comparison with those obtained by other algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI