特征选择
计算机科学
模式识别(心理学)
特征(语言学)
分类器(UML)
特征向量
人工智能
数据挖掘
理论(学习稳定性)
机器学习
哲学
语言学
作者
Mengmeng Li,Zhigang Shang,Caitong Yue
标识
DOI:10.1007/978-3-319-68759-9_47
摘要
To remove the irrelevant and redundant features from the high-dimensional data while ensuring classification accuracy, a supervised feature subset evaluation method based on multi-objective optimization has been proposed in this paper. Four aspects, sparsity of feature space, classification accuracy, information loss degree and feature subset stability, were took into account in the proposed method and the Multi-objective functions were constructed. Then the popular NSGA-II algorithm was used for optimization of the four objectives in the feature selection process. Finally the feature subset was selected based on the obtained feature weight vector according the four evaluation criteria. The proposed method was tested on 4 standard data sets using two kinds of classifier. The experiment results show that the proposed method can guarantee the higher classification accuracy even though only few numbers of features selected than the other methods. On the other hand, the information loss degrees of the proposed method are the lowest which demonstrates that the selected feature subsets of the proposed method can represent the original data sets best.
科研通智能强力驱动
Strongly Powered by AbleSci AI