特征选择
粗集
数据挖掘
模糊集
模糊逻辑
稳健性(进化)
人工智能
模式识别(心理学)
计算机科学
特征(语言学)
隶属函数
机器学习
数学
生物化学
化学
语言学
哲学
基因
作者
Xiaoling Yang,Hongmei Chen,Hao Wang,Tianrui Li,Yu Zeng,Zhihong Wang,Chuan Luo
标识
DOI:10.1109/tfuzz.2022.3206508
摘要
Fuzzy rough set theory can model uncertainty in data and has been applied to feature selection for machine learning tasks. The existence of noise in data is one of the reasons for data uncertainty. However, most classical fuzzy rough set models are often sensitive to the noise in data, which somewhat degrades their applicability to process uncertainty of data. Furthermore, a robust feature evaluation function is nontrivial in a fuzzy rough set model as a nonoptimal feature subsets may be selected due to the perturbations from redundant features. In this article, we delve into local density and indispensable features for fuzzy rough feature selection to address these challenges. We first propose a local density-based fuzzy rough set (LDFRS) model to tackle noisy data. Mutual information is then plugged into the proposed LDFRS model to evaluate uncertainty in data. A joint feature evaluation function on the indispensability and relevance of features is constructed to evaluate the significance of features. On this basis, a fuzzy rough feature selection algorithm is built upon the LDFRS model. Experimental results using four typical classifiers demonstrate the robustness and effectiveness of the proposed model including our feature selection algorithm and its superiority against baseline methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI