鉴别器
计算机科学
过采样
水准点(测量)
残余物
人工智能
模式识别(心理学)
约束(计算机辅助设计)
采样(信号处理)
算法
数据挖掘
机器学习
探测器
数学
计算机网络
电信
几何学
大地测量学
带宽(计算)
地理
作者
Tingting Pan,Witold Pedrycz,Jie Yang,Jian Wang
标识
DOI:10.1016/j.engappai.2024.107934
摘要
Many oversampling methods applied to imbalanced data generate samples according to local density distribution of minority samples. However, samples generated by these methods can only present a non-deterministic relationship between the local and global distributions. A generative adversarial network (GAN) is a suitable tool to learn an unknown global probability distribution. In this paper, we propose an improved GAN (I-GAN) to oversample according to the global underlying structure of minority samples. The originality of I-GAN stems from the fact it provides additional density distribution information of minority samples for GAN and generated samples. By building on this idea, three detailed strategies are presented: input random vectors of the generator are sampled from a rough estimate of the distribution of minority samples to orientate fake samples more believable; a residual about minority samples is added into the discriminator to strengthen the constraint of loss function; generated samples are redistributed with a reshaper. These three strategies provide innovative methodologies at various stages of GANs for the oversampling task. Compared with 22 classical and popular imbalanced sampling methods under metrics of Gm, F1, and AUC on 24 benchmark imbalanced datasets, it is shown that I-GAN is effective and robust. The I-GAN implementation line procedure has been uploaded to Github (https://github.com/flowerbloom000/I-GAN).
科研通智能强力驱动
Strongly Powered by AbleSci AI