特征选择
熵(时间箭头)
人工智能
模式识别(心理学)
特征(语言学)
模糊逻辑
相关性
数据挖掘
计算机科学
数学
联合熵
机器学习
最大熵原理
语言学
哲学
物理
几何学
量子力学
作者
Jianhua Dai,Qi Liu,Xiongtao Zou,Chucai Zhang
标识
DOI:10.1016/j.ins.2023.119753
摘要
Feature selection is a commonly employed method to decrease data processing complexity by discarding unnecessary and repetitive features. An effective feature selection method can mitigate the challenges posed by high-dimensional data, save computing resources and improve learning performance. Combination entropy is a useful tool for assessing feature uncertainty, which provides an intuitive representation of the amount of information. However, classical combination entropy is difficult to be directly used for continuous features. Therefore, we propose the concept of fuzzy combination entropy. Moreover, we put forward an importance metric that comprehensively considers global feature correlation and local feature correlation. Firstly, the fuzzy combination entropy (FCE) is presented based on the fuzzy λ-similarity relation. Secondly, by combining the benefits of fuzzy rough sets and combination entropy, fuzzy combination entropy and its variants are constructed, and their related properties are also discussed. Thirdly, the concepts of global feature correlation and local feature correlation are defined and an importance metric is proposed. Finally, a feature selection method according to fuzzy combination entropy considering global feature correlation and local feature correlation (FSmFCE) is designed. According to the findings from our experiments, it is evident that our algorithm demonstrates a preference for selecting a smaller feature set, yet still achieves commendable classification performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI