特征(语言学)
聚类分析
模糊逻辑
模式识别(心理学)
模糊聚类
数学
火焰团簇
人工智能
算法
基质(化学分析)
计算机科学
数据挖掘
CURE数据聚类算法
哲学
语言学
材料科学
复合材料
作者
Haiyan Yu,Lerong Jiang,Jiulun Fan,Shuang Xie,Rong Lan
标识
DOI:10.1016/j.eswa.2023.122270
摘要
The possibilistic fuzzy c-means clustering (PFCM) algorithm is a hybridization of possibilistic c-means clustering (PCM) and fuzzy c-means clustering (FCM) algorithms. However, there are two main problems in PFCM. One is that the Euclidean distance employed in PFCM always disregards the imbalance among sample features because it treats all features of data equally, which easily causes misclassification for feature-imbalanced multidimensional data. The other is that PFCM always produces significant center deviations and overlapping centers for multiclass datasets with strong noise injection, due to the difficulty of PFCM in the membership-weight parameter setting and the lack of between-class relationships in possibilistic memberships. Therefore, a feature-weighted suppressed possibilistic fuzzy c-means clustering (FW-S-PFCM) algorithm is presented by introducing a feature-weighted method and "suppressed competitive learning" strategy into the PFCM algorithm in this paper. First, the FW-S-PFCM algorithm introduces a feature-weight matrix into the objective function that can automatically assign feature-weight values to different features and different clusters according to the distribution of samples, thus overcoming the influence of feature imbalance and improving clustering effects for noisy multidimensional datasets. Second, combined with the feature-weight matrix, a "suppressed competitive learning" strategy is designed to resolve the center-overlapping problem in noisy multiclass dataset clustering. Specifically, partial crucial points of each class near the center are selected according to a cluster core generated by a cross-section of a threshold on the possibilistic membership surface. Third, their possibilistic memberships participate in the suppressed learning process to overcome the lack of between-class relationships. Last, a segmentation algorithm for noisy color images based on FW-S-PFCM is proposed combined with the feature-weight method and noise-identification ability of possibilistic memberships. Experiments on synthetic data, UCI data and color image segmentation demonstrate that the proposed FW-S-PFCM algorithm can overcome the partial center-overlapping problem and improve clustering performance on complex datasets with feature imbalance and strong noise injection. The proposed algorithm can also reduce the iteration number, sensitivity to membership weights, and initializations of PFCM.
科研通智能强力驱动
Strongly Powered by AbleSci AI