计算机科学
主成分分析
特征选择
聚类分析
模式识别(心理学)
数据挖掘
人工智能
排名(信息检索)
特征(语言学)
选择(遗传算法)
特征提取
多元统计
机器学习
哲学
语言学
作者
Hyunjin Yoon,Kevin J Yang,Cyrus Shahabi
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2005-08-03
卷期号:17 (9): 1186-1198
被引量:186
标识
DOI:10.1109/tkde.2005.144
摘要
Feature subset selection (FSS) is a known technique to preprocess the data before performing any data mining tasks, e.g., classification and clustering. FSS provides both cost-effective predictors and a better understanding of the underlying process that generated the data. We propose a family of novel unsupervised methods for feature subset selection from multivariate time series (MTS) based on common principal component analysis, termed CLeVer. Traditional FSS techniques, such as recursive feature elimination (RFE) and Fisher criterion (FC), have been applied to MTS data sets, e.g., brain computer interface (BCI) data sets. However, these techniques may lose the correlation information among features, while our proposed techniques utilize the properties of the principal component analysis to retain that information. In order to evaluate the effectiveness of our selected subset of features, we employ classification as the target data mining task. Our exhaustive experiments show that CLeVer outperforms RFE, FC, and random selection by up to a factor of two in terms of the classification accuracy, while taking up to 2 orders of magnitude less processing time than RFE and FC.
科研通智能强力驱动
Strongly Powered by AbleSci AI