数据库扫描
聚类分析
计算机科学
CURE数据聚类算法
树冠聚类算法
数据挖掘
数据流聚类
相关聚类
模式识别(心理学)
人工智能
确定数据集中的群集数
高维数据聚类
单连锁聚类
算法
出处
期刊:2020 7th International Forum on Electrical Engineering and Automation (IFEEA)
日期:2020-09-01
卷期号:: 949-953
被引量:140
标识
DOI:10.1109/ifeea51475.2020.00199
摘要
Clustering technology has important applications in data mining, pattern recognition, machine learning and other fields. However, with the explosive growth of data, traditional clustering algorithm is more and more difficult to meet the needs of big data analysis. How to improve the traditional clustering algorithm and ensure the quality and efficiency of clustering under the background of big data has become an important research topic of artificial intelligence and big data processing. The density-based clustering algorithm can cluster arbitrarily shaped data sets in the case of unknown data distribution. DBSCAN is a classical density-based clustering algorithm, which is widely used for data clustering analysis due to its simple and efficient characteristics. The purpose of this paper is to study DBSCAN clustering algorithm based on density. This paper first introduces the concept of DBSCAN algorithm, and then carries out performance tests on DBSCAN algorithm in three different data sets. By analyzing the experimental results, it can be concluded that DBSCAN algorithm has higher homogeneity and diversity when it performs personalized clustering on data sets of non-uniform density with broad values and gradually sparse forwards. When the DBSCAN algorithm's neighborhood distance eps is 1000, 26 classes are generated after clustering.
科研通智能强力驱动
Strongly Powered by AbleSci AI