High-density cluster core-based k-means clustering with an unknown number of clusters

聚类分析 计算机科学 人工智能
作者
Abhimanyu Kumar,Abhishek Kumar,Rammohan Mallipeddi,Dong-Gyu Lee
出处
期刊:Applied Soft Computing [Elsevier BV]
卷期号:: 111419-111419 被引量:2
标识
DOI:10.1016/j.asoc.2024.111419
摘要

The k-means algorithm, known for its simplicity and adaptability, faces challenges related to manual cluster number selection and sensitivity to initial centroid placement. This paper introduces an innovative framework aimed at overcoming these challenges. By proposing a data-driven cluster number estimation method and a robust initialization strategy based on high-density cluster cores, our approach revolutionizes k-means, unlocking its full unsupervised potential and ensuring superior performance, even in scenarios involving overlapping clusters. The method employs a novel density-based technique to accurately identify cluster cores, resulting in substantial improvements over existing methods. Rigorous experimentation on synthetic and real-world datasets demonstrates an average performance enhancement of 15% in terms of the Adjusted Rand Index for datasets with overlapping clusters, surpassing the capabilities of state-of-the-art density-based clustering methods and traditional k-means. Moreover, our method autonomously determines the optimal number of clusters, facilitating true unsupervised learning and eliminating the impact of initial centroid placement on clustering outcomes. This leads to stable and consistent results, addressing key limitations of the conventional k-means algorithm. The practical applicability of our approach is exemplified in image segmentation tasks, showcasing its versatility and reliability in real-world scenarios.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
小杜完成签到,获得积分10
1秒前
w9412完成签到,获得积分10
1秒前
程馨完成签到,获得积分20
2秒前
白薇完成签到 ,获得积分10
4秒前
JUN完成签到,获得积分10
5秒前
zgaolei完成签到,获得积分10
5秒前
小破仁完成签到,获得积分10
5秒前
5秒前
5秒前
5秒前
z_完成签到,获得积分10
6秒前
迪迦奥特曼完成签到,获得积分10
7秒前
Hello应助郝憨憨采纳,获得10
8秒前
Gogoal发布了新的文献求助10
9秒前
以利沙完成签到 ,获得积分10
9秒前
里伊森完成签到,获得积分10
10秒前
独特的凝云完成签到 ,获得积分0
10秒前
小冯完成签到,获得积分10
10秒前
王蕊完成签到,获得积分10
11秒前
哈基米德发布了新的文献求助50
11秒前
yayika完成签到 ,获得积分10
11秒前
鸭梨不酸完成签到,获得积分10
12秒前
RadiantYT完成签到,获得积分10
13秒前
千陌完成签到 ,获得积分10
14秒前
VI完成签到,获得积分10
14秒前
冷酷曼卉完成签到,获得积分10
14秒前
guangyu发布了新的文献求助10
14秒前
辛勤的囧完成签到,获得积分10
15秒前
Q清风慕竹完成签到,获得积分10
15秒前
li完成签到,获得积分10
15秒前
只鱼完成签到 ,获得积分10
18秒前
nit关注了科研通微信公众号
18秒前
NiL完成签到,获得积分10
19秒前
fighting完成签到 ,获得积分10
19秒前
yiyi完成签到,获得积分10
19秒前
稳重的奇迹完成签到,获得积分10
19秒前
yuerr完成签到,获得积分10
20秒前
21秒前
666完成签到,获得积分10
21秒前
鸭子完成签到,获得积分10
21秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
CLSI M100 Performance Standards for Antimicrobial Susceptibility Testing 36th edition 400
Cancer Targets: Novel Therapies and Emerging Research Directions (Part 1) 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6362335
求助须知:如何正确求助?哪些是违规求助? 8176040
关于积分的说明 17224917
捐赠科研通 5417007
什么是DOI,文献DOI怎么找? 2866686
邀请新用户注册赠送积分活动 1843801
关于科研通互助平台的介绍 1691625