聚类分析
CURE数据聚类算法
数据挖掘
计算机科学
相关聚类
数据流聚类
单连锁聚类
模糊聚类
树冠聚类算法
确定数据集中的群集数
算法
网格
k-中位数聚类
火焰团簇
数学
人工智能
几何学
作者
Fengying Ma,Cheng Wang,Jian Huang,Qiuping Zhong,Tengfei Zhang
标识
DOI:10.1016/j.ins.2024.120109
摘要
In the network environment, data from various industries is dynamic and large-scale. Traditional clustering algorithms struggle to effectively utilize existing clustering results when faced with continuously evolving data, which makes the incremental grid-based clustering highly regarded. However, the existing incremental grid-based clustering algorithms fail to adequately consider the impact of newly added data on the original cluster structure. To address this issue, the key grids based batch-incremental CLIQUE clustering algorithm is proposed. The algorithm designates the incremental data mapping grids, which are or their neighbour girds are mixed with original data, as key grids to fully consider the cluster structure changes caused by the incremental data. Moreover, the cluster similarity coefficient based on grid features is introduced to measure density differences between the incremental data and the original clusters, and the cluster membership degree is defined to further consider the cluster membership of boundary sparse grid data and the identification of noise points. All of which ensures that the algorithm can adaptively create, merge or split clusters with the arrival of new data. Experimental results show that the proposed algorithm can adaptively adjust the cluster structure during incremental clustering, outperforming in accuracy and efficiency when clustering large-scale, dynamically changing data.
科研通智能强力驱动
Strongly Powered by AbleSci AI