理论(学习稳定性)
特征选择
集成学习
人工智能
机器学习
选择(遗传算法)
计算机科学
特征(语言学)
模式识别(心理学)
工程类
哲学
语言学
作者
Xin Feng,Yu-Long Zhao,Meng Zhang,Ying Zuo,Xiaofu Zou,Fei Tao
标识
DOI:10.1016/j.jmsy.2024.03.001
摘要
The uncertainty and complexity of real data collected in the industrial production process increase the difficulty in data-based knowledge discovering. Feature selection is an important step to remove redundant and irrelevant data, and thus it is essential to construct an efficient feature selection method. In this paper, an ensemble learning-driven stable feature selection method is proposed to improve the stability and accuracy of the feature selection. Firstly, datasets of different characteristics are generated to increase the diversity of data segments for feature selection. Secondly, two criteria (stability and prediction accuracy) are adopted to evaluate the performance weight of each feature selection algorithm, to ensure that the results of high-performance selectors have high priority in the algorithm aggregation process. Thirdly, the feature subsets are weighted and filtered based on expert experience to further ensure its stability. Finally, comparative experiments are conducted to show the effectiveness of the proposed method. Comparing with other methods, the proposed one can achieve the highest overall stability for feature selection (namely 0.936 measured by the Spearman rank correlation coefficient), and select the reasonable feature subset for data-driven prediction with the low mean absolute error (namely 0.315 as the average level).
科研通智能强力驱动
Strongly Powered by AbleSci AI