特征选择
计算机科学
维数之咒
人工智能
粒子群优化
冗余(工程)
特征(语言学)
机器学习
降维
最小冗余特征选择
数据挖掘
高维数据聚类
进化计算
模式识别(心理学)
特征向量
人口
聚类分析
操作系统
哲学
社会学
人口学
语言学
作者
Binh Tran,Bing Xue,Mengjie Zhang
标识
DOI:10.1145/3321707.3321713
摘要
Feature space is an important factor influencing the performance of any machine learning algorithm including classification methods. Feature selection aims to remove irrelevant and redundant features that may negatively affect the learning process especially on high-dimensional data, which usually suffers from the curse of dimensionality. Feature ranking is one of the most scalable feature selection approaches to high-dimensional problems, but most of them fail to automatically determine the number of selected features as well as detect redundancy between features. Particle swarm optimisation (PSO) is a population-based algorithm which has shown to be effective in addressing these limitations. However, its performance on high-dimensional data is still limited due to the large search space and high computation cost. This study proposes the first adaptive multi-swarm optimisation (AMSO) method for feature selection that can automatically select a feature subset of high-dimensional data more effectively and efficiently than the compared methods. The subswarms are automatically and dynamically changed based on their performance during the evolutionary process. Experiments on ten high-dimensional datasets of varying difficulties have shown that AMSO is more effective and more efficient than the compared PSO-based and traditional feature selection methods in most cases.
科研通智能强力驱动
Strongly Powered by AbleSci AI