A Comprehensive Review of Dimensionality Reduction Techniques for Feature Selection and Feature Extraction

降维 计算机科学 特征选择 维数之咒 人工智能 机器学习 冗余(工程) 数据挖掘 特征提取 模式识别(心理学) 操作系统
作者
Rizgar R. Zebari,Adnan Mohsin Abdulazeez,Diyar Qader Zeebaree,Dilovan Assad Zebari,Jwan Najeeb Saeed
出处
期刊:Journal of applied science and technology trends [Interdisciplinary Publishing Academia]
卷期号:1 (1): 56-70 被引量:726
标识
DOI:10.38094/jastt1224
摘要

Due to sharp increases in data dimensions, working on every data mining or machine learning (ML) task requires more efficient techniques to get the desired results. Therefore, in recent years, researchers have proposed and developed many methods and techniques to reduce the high dimensions of data and to attain the required accuracy. To ameliorate the accuracy of learning features as well as to decrease the training time dimensionality reduction is used as a pre-processing step, which can eliminate irrelevant data, noise, and redundant features. Dimensionality reduction (DR) has been performed based on two main methods, which are feature selection (FS) and feature extraction (FE). FS is considered an important method because data is generated continuously at an ever-increasing rate; some serious dimensionality problems can be reduced with this method, such as decreasing redundancy effectively, eliminating irrelevant data, and ameliorating result comprehensibility. Moreover, FE transacts with the problem of finding the most distinctive, informative, and decreased set of features to ameliorate the efficiency of both the processing and storage of data. This paper offers a comprehensive approach to FS and FE in the scope of DR. Moreover, the details of each paper, such as used algorithms/approaches, datasets, classifiers, and achieved results are comprehensively analyzed and summarized. Besides, a systematic discussion of all of the reviewed methods to highlight authors' trends, determining the method(s) has been done, which significantly reduced computational time, and selecting the most accurate classifiers. As a result, the different types of both methods have been discussed and analyzed the findings.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
小赵发布了新的文献求助10
2秒前
打打应助mi采纳,获得10
2秒前
余正扬完成签到,获得积分20
2秒前
daweiwei发布了新的文献求助10
3秒前
打打应助77采纳,获得10
3秒前
4秒前
清爽幻竹完成签到,获得积分10
4秒前
4秒前
啊啊啊完成签到 ,获得积分10
5秒前
科研通AI2S应助BBrian采纳,获得10
5秒前
ZeKaWa应助zhuzhen007采纳,获得10
5秒前
5秒前
6秒前
GEMINI发布了新的文献求助10
6秒前
orixero应助花怜采纳,获得10
7秒前
shisui应助crowd_lpy采纳,获得30
7秒前
田様应助升龙击采纳,获得10
7秒前
Lucas应助ds采纳,获得10
7秒前
lee发布了新的文献求助10
9秒前
somnus发布了新的文献求助10
9秒前
木月祈完成签到,获得积分10
10秒前
子轩完成签到 ,获得积分10
10秒前
10秒前
heitao发布了新的文献求助10
10秒前
11秒前
11秒前
11秒前
11秒前
橡皮人发布了新的文献求助10
11秒前
12秒前
13秒前
皮问安完成签到,获得积分10
13秒前
14秒前
李李发布了新的文献求助30
14秒前
哈哈哈哈完成签到,获得积分10
14秒前
14秒前
15秒前
充电宝应助lc采纳,获得10
15秒前
15秒前
斯文败类应助WYB采纳,获得10
15秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 1200
Signals, Systems, and Signal Processing 610
Software that combines deep learning,3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
Adhesion Science: Principles & Practice 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6492290
求助须知:如何正确求助?哪些是违规求助? 8289950
关于积分的说明 17689725
捐赠科研通 5584079
什么是DOI,文献DOI怎么找? 2915278
邀请新用户注册赠送积分活动 1892419
关于科研通互助平台的介绍 1750464