粒子群优化
特征选择
特征(语言学)
计算机科学
替代模型
选择(遗传算法)
人工智能
进化计算
多群优化
数学优化
模式识别(心理学)
样品(材料)
元启发式
高维
算法
数学
机器学习
物理
哲学
热力学
语言学
作者
Xianfang Song,Zhang Yon,Dunwei Gong,Hui Liu,Wanqiu Zhang
标识
DOI:10.1109/tevc.2022.3175226
摘要
With the increase of the number of features and the sample size, existing feature selection (FS) methods based on evolutionary optimization still face challenges such as the "curse of dimensionality" and the high computational cost. In view of this, dividing or clustering the sample and feature spaces at the same time, this article proposes a hybrid FS algorithm using surrogate sample-assisted particle swarm optimization (SS-PSO). First, a nonrepetitive uniform sampling strategy is employed to divide the whole sample set into several small-size sample subsets. Regarding each sample subset as a surrogate unit, next, a collaborative feature clustering mechanism is proposed to divide the feature space, with the purpose of reducing both the computational cost of clustering feature and the search space of PSO. Following that, an ensemble surrogate-assisted integer PSO is proposed. To ensure the prediction accuracy of ensemble surrogate when evaluating particles, an ensemble surrogate construction and management strategy is designed. Since the whole sample set is replaced by a small number of surrogate units, SS-PSO significantly reduces the cost of evaluating particles in PSO. Finally, the proposed algorithm is applied to some typical datasets, and compared with six typical evolutionary FS algorithms, as well as its several variant algorithms. The experimental results show that SS-PSO can obtain good feature subsets at the smallest computational cost on most of datasets. All verify that SS-PSO is a highly competitive method for high-dimensional FS.
科研通智能强力驱动
Strongly Powered by AbleSci AI