聚类分析
星团(航天器)
计算机科学
点(几何)
最近邻链算法
数据点
k-中位数聚类
算法
数据挖掘
k-最近邻算法
CURE数据聚类算法
相关聚类
树冠聚类算法
数学
人工智能
几何学
程序设计语言
作者
Neha Aggarwal,Kirti Aggarwal
摘要
Data clustering is an unsupervised classification method aims at creating groups of objects, or clusters, in such a way that objects in the same cluster are very similar and objects in different clusters are quite distinct. K-means is an iterative algorithm in which the number of clusters must be determined before the execution.In this book an efficient k-means algorithm is proposed. Since, in each iteration, the k-means algorithm computes the distances between data point and all centers, this is computationally very expensive especially for huge data sets. For each data point, we can keep the distance to the nearest cluster. At the next iteration, we compute the distance to the previous nearest cluster. If the new distance is less than or equal to the previous distance, the point stays in its cluster, and there is no need to compute its distances to the other cluster centers. This saves the time required to compute distances to k−1 clusters. Experimental results show the accuracy and effectiveness of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI