特征选择
计算机科学
分类器(UML)
数据挖掘
降维
遗传程序设计
维数之咒
人工智能
特征(语言学)
滤波器(信号处理)
模式识别(心理学)
适应度函数
进化算法
人口
机器学习
遗传算法
哲学
语言学
计算机视觉
人口学
社会学
作者
Marwa Hammami,Slim Bechikh,Chih‐Cheng Hung,Lamjed Ben Saïd
标识
DOI:10.1109/cec.2018.8477771
摘要
Feature selection and construction are important pre-processing techniques in data mining. They may allow not only dimensionality reduction but also classifier accuracy and efficiency improvement. These two techniques are of great importance especially for the case of high-dimensional data. Feature construction for high-dimensional data is still a very challenging topic. This can be explained by the large search space of feature combinations, whose size is a function of the number of features. Recently, researchers have used Genetic Programming (GP) for feature construction and the obtained results were promising. Unfortunately, the wrapper evaluation of each feature subset, where a feature can be constructed by a combination of features, is computationally intensive since such evaluation requires running the classifier on the data sets. Motivated by this observation, we propose, in this paper, a hybrid multiobjective evolutionary approach for efficient feature construction and selection. Our approach uses two filter objectives and one wrapper objective corresponding to the accuracy. In fact, the whole population is evaluated using two filter objectives. However, only non-dominated (best) feature subsets are improved using an indicator-based local search that optimizes the three objectives simultaneously. Our approach has been assessed on six high-dimensional datasets and compared with two existing prominent GP approaches, using three different classifiers for accuracy evaluation. Based on the obtained results, our approach is shown to provide competitive and better results compared with two competitor GP algorithms tested in this study.
科研通智能强力驱动
Strongly Powered by AbleSci AI