聚类分析
计算机科学
数据挖掘
模糊聚类
杠杆(统计)
度量(数据仓库)
集合(抽象数据类型)
约束聚类
人工智能
相似性度量
模糊逻辑
机器学习
CURE数据聚类算法
程序设计语言
作者
Tamunokuro Opubo William-West,Armand F. Donfack Kana,Musa Adeku Ibrahim
标识
DOI:10.1016/j.ins.2022.01.018
摘要
Shadowed set approximation form a cornerstone for the explainable decision advice provided by shadowed C means (SCM) clustering in unsupervised learning. Due to its advantage of managing uncertainty in fuzzy clustering, it has been used in data classification. Existing SCM clustering method requires the overall amount of uncertainty associated with a fuzzy cluster ck to be preserved in its boundary region. This requirement may suffer serious risk of having high number of unclassified patterns, especially when the uncertainty in ck is very high. Consequently, the ensuing clustering may not adequately maximize the inter-cluster separation necessary for achieving optimum cluster validity results. To tackle this problem, this paper considers new SCM clustering methods arising from (i) trade-off between uncertain and certain regions, which is necessary for refraining from making uncertain classification as much as possible, (ii) measure of sharpness balance, which helps to leverage on the location of a pattern from borderline and identify included or excluded patterns by means of their location from borderline, (iii) measure of gradualness balance, which exploits the degree of transition of a pattern out of or into ck. Each method comes with some advantages. For instance, the first and third methods may minimize the overall amount of unclassified patterns. To provide an overall evaluation of the performance of the proposed methods, a comparative study with some other shadowed set-based optimization methods are involved by considering some data sets from UCI Machine Learning repository. Friedman testing followed by Holm-Bonferroni testing are also exploited to provide statistical analysis on the performance significance of the compared methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI