数据挖掘
特征选择
熵(时间箭头)
模糊逻辑
计算机科学
人工智能
粗集
模糊集
机器学习
联合熵
模式识别(心理学)
数学
最大熵原理
数学优化
量子力学
物理
作者
Weihua Xu,Kehua Yuan,Wentao Li,Weiping Ding
出处
期刊:IEEE transactions on emerging topics in computational intelligence
[Institute of Electrical and Electronics Engineers]
日期:2022-05-27
卷期号:7 (1): 76-88
被引量:69
标识
DOI:10.1109/tetci.2022.3171784
摘要
Feature selection based on neighborhood rough set is a noteworthy step in dealing with numerical data. Information entropy, proven in many theoretical analysis and practical applications, is a compelling feature evaluation method for uncertainty information measures. Nonetheless, information entropy replaces probability with uncertainty measure to evaluate the average amount of information and ignores the decision distribution of data, especially in describing the uncertainty in imbalanced data. This paper discusses an emerging method for the feature selection in fuzzy data with imbalanced data by presenting a local composite entropy based on a neighborhood rough set. Based on the neighborhood rough set model, we discuss a similar relation to describe the relationship between different objects in unbalanced fuzzy data. In this process, to fully consider the distribution characteristics of unbalanced data, we construct a local composite entropy for handling the fuzzy decision systems with uncertainty and decision distribution, which is proven to be monotonic. Moreover, to improve the selection efficiency, a local heuristic forward greedy selection algorithm based on the local composite measure is designed to select the optimal feature subset. Finally, experimental results on twelve public datasets demonstrate that our method has better classification performance than some state-of-the-art feature selection methods in fuzzy data.
科研通智能强力驱动
Strongly Powered by AbleSci AI