计算机科学
加入
回归
背景(考古学)
人工智能
集合(抽象数据类型)
机器学习
数据挖掘
模式识别(心理学)
数学
统计
生物
古生物学
程序设计语言
作者
Luís Camacho,Georgios Douzas,Fernando Bação
标识
DOI:10.1016/j.eswa.2021.116387
摘要
Learning from imbalanced data sets is known to be a challenging task. There are many proposals to tackle the challenge for classification problems, but regarding regression the solutions are few. In the context of regression, imbalanced learning means that there is a concern with the accurate prediction of the target values in a subset of the continuous target variable, considering that these values rarely occur in the data set. In this article, we extend the G-SMOTE algorithm that is used in classification to regression tasks. G-SMOTE is a pre-processing algorithm that differs from the SMOTE algorithm as it allows the generation of synthetic instances in a geometric region around the selected instances rather than in the line segment that joins the two selected instances. The performance of G-SMOTE for regression was compared against other methods, and the empirical results show that our proposal outperformed those methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI