计算机科学
边距(机器学习)
对抗制
人工智能
机器学习
采样(信号处理)
通知
生成语法
合成数据
模式识别(心理学)
数据挖掘
计算机视觉
政治学
滤波器(信号处理)
法学
作者
Ke Wang,Tongqing Zhou,Menghua Luo,Xionglue Li,Zhiping Cai
标识
DOI:10.1016/j.eswa.2023.121696
摘要
The imbalanced data problem is widely recognized in real-world datasets. To avoid learning bias on imbalanced data, the over-sampling strategy is well studied and adopted for generating synthetic minority samples. Wherein, the Synthetic Minority Over-sampling Technology and its improved algorithms become standard baselines. In recent years, the popular Generative Adversarial Networks and its enhanced variants, introduced from the computer vision community, are believed to generate better samples, by approximating the true data distribution. Yet, we notice that the synthetic samples for the minority category in these existing methods is usually restrained in a limited samples space known in advance, which may mislead the classifiers trained on them to take data in the unsampled region of the minority category as from the majority category. Given such limitations, we propose a Generative Adversarial Minority Enlargement (GAME) method to intentionally extend the sampling margin during data generative and adversarial phases. This is accomplished by adjusting the parameters of a local linear model to approach the majority category. We conduct an extensive evaluation on 28 datasets of different domains, extracted from the UCI real-world datasets. The results show that GAME can achieve more balanced and stable results efficiently than 18 state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI