Enhanced Fuzzy Clustering for Incomplete Instance with Evidence Combination
聚类分析
数据挖掘
模糊聚类
计算机科学
模糊逻辑
人工智能
数学
机器学习
作者
Zhe Liu,Sukumar Letchmunan
出处
期刊:ACM Transactions on Knowledge Discovery From Data [Association for Computing Machinery] 日期:2024-01-12卷期号:18 (3): 1-20被引量:22
标识
DOI:10.1145/3638061
摘要
Clustering incomplete instance is still a challenging task since missing values maybe make the cluster information ambiguous, leading to the uncertainty and imprecision in results. This article investigates an enhanced fuzzy clustering with evidence combination method based on Dempster-Shafer theory (DST) to address this problem. First, the dataset is divided into several subsets, and missing values are imputed by neighbors with different weights in each subset. It aims to model missing values locally to reduce the negative impact of the bad estimations. Second, an objective function of enhanced fuzzy clustering is designed and then optimized until the best membership and reliability matrices are found. Each subset has a membership matrix that contains all sub-instances’ membership to different clusters. The fuzzy reliability matrix is employed to characterize the reliability of each subset on different clusters. Third, an adaptive evidence combination rule based on the DST is developed to combine the discounted subresults (memberships) with different reliability to make the final decision for each instance. The proposed method can characterize uncertainty and imprecision by assigning instances to specific clusters or meta-clusters composed of several specific clusters. Once an instance is assigned to a meta-cluster, the cluster information of this instance is (locally) imprecise. The effectiveness of proposed method is demonstrated on several real-world datasets by comparing with existing techniques.