聚类分析
计算机科学
CURE数据聚类算法
扩展(谓词逻辑)
公制(单位)
数据挖掘
相关聚类
树冠聚类算法
算法
透视图(图形)
点(几何)
人工智能
机器学习
数学
几何学
运营管理
经济
程序设计语言
作者
Ming-Hao Tung,Yi‐Ping Phoebe Chen,Chen-Yu Liu,Chung-Shou Liao
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:: 1-1
被引量:1
标识
DOI:10.1109/tkde.2022.3161117
摘要
Clustering algorithms have been widely studied in many scientific areas, such as data mining, knowledge discovery, bioinformatics and machine learning. A density-based clustering algorithm, called density peaks (DP), which was proposed by Rodriguez and Laio, outperform almost all other approaches. Although the DP algorithm performs well in many cases, there is still room for improvement in the precision of its output clusters as well as the quality of the selected centers. In this study, we propose a more accurate clustering algorithm, seed-and-extension-based density peaks (SDP). SDP selects the centers that hold the features of their clusters while building a spanning forest, and meanwhile, constructs the output clusters in a seed-and-extension manner. Experiment results demonstrate the effectiveness of SDP, especially when dealing with clusters with relatively high densities. Precisely, we show that SDP is more accurate than the DP algorithm as well as other state-of-the-art clustering approaches concerning the quality of both output clusters and cluster centers while maintaining similar running time of the DP algorithm, particularly for a variety of time-series (i.e. non-metric) data. Moreover, SDP outperforms DP in the dynamic model in which data point insertion and deletion are allowed. From a practical perspective, the proposed SDP algorithm is obviously helpful to many application problems.
科研通智能强力驱动
Strongly Powered by AbleSci AI