班级(哲学)
计算机科学
样品(材料)
数据集
样本量测定
特征选择
集合(抽象数据类型)
数据挖掘
过程(计算)
特征(语言学)
选择(遗传算法)
数据分类
人工智能
机器学习
数学
统计
语言学
化学
哲学
色谱法
程序设计语言
操作系统
作者
Abhishek Yogi,Ratul Dey
出处
期刊:International research journal of computer science
[AM Publications]
日期:2022-04-30
卷期号:9 (4): 56-60
被引量:2
标识
DOI:10.26562/irjcs.2021.v0904.002
摘要
In last few years there are many changes and evolution has been done on the classification of data. Application area of technology increases then the size of data also increases. Classification of data becomes difficult because of imbalance nature and unbounded size of data. Class imbalance problem becomes the greatest issue in the data mining. Imbalance problem occurs where one of the two classes having more sample than the other classes. The most of the algorithms are focusing on the classification of major sample while ignoring minority sample. The minority samples are those samples that rarely occur but are very important. There are different methods available for the classification of imbalance data set which is divided into three categories i.e. the algorithmic approach, feature selection approach and the data processing approach. These approaches have their own advantages and disadvantages. In this paper a systematic study of each process is defined which gives the right direction for research in class imbalance problem.
科研通智能强力驱动
Strongly Powered by AbleSci AI