聚类分析
计算机科学
水准点(测量)
噪音(视频)
数据挖掘
数据库扫描
数据点
模式识别(心理学)
算法
数学
人工智能
相关聚类
CURE数据聚类算法
大地测量学
图像(数学)
地理
作者
Shahin Pourbahrami,Mahdi Hashemzadeh
标识
DOI:10.1016/j.ins.2022.08.047
摘要
Neighborhood-based and density-based clustering methods are applied in various data analysis applications. However, most of them have low performance in mixed heterogeneous datasets, and high computational cost. To mitigate these challenges, in this paper, a geometric-based clustering method is proposed which uses the concept of natural neighborhoods to extract the local density of data points. At the first stage of the algorithm, primary clusters are formed by identifying dense points that have a higher number of natural neighbors. Then, the heads of the clusters are extracted in dense points, based on the local natural density criterion. Afterward, a novel natural neighborhood-based method is applied to identify the points having a low number of natural neighbors, which called weak or noise data points. Finally, the desired clusters are obtained by ignoring the weak/noise points. The computational complexity of the proposed method is O(nlogn). The proposed method is independent of any specific parameter setting. The results of experiments on benchmark datasets confirm that it achieves a higher performance compared to the competitors.
科研通智能强力驱动
Strongly Powered by AbleSci AI