计算机科学
机器学习
人工智能
采样(信号处理)
自适应采样
数学
统计
蒙特卡罗方法
计算机视觉
滤波器(信号处理)
作者
Haibo He,Yang Bai,Edwardo A. Garcia,Shutao Li
出处
期刊:International Joint Conference on Neural Network
日期:2008-06-01
卷期号:: 1322-1328
被引量:2844
标识
DOI:10.1109/ijcnn.2008.4633969
摘要
This paper presents a novel adaptive synthetic (ADASYN) sampling approach for learning from imbalanced data sets. The essential idea of ADASYN is to use a weighted distribution for different minority class examples according to their level of difficulty in learning, where more synthetic data is generated for minority class examples that are harder to learn compared to those minority examples that are easier to learn. As a result, the ADASYN approach improves learning with respect to the data distributions in two ways: (1) reducing the bias introduced by the class imbalance, and (2) adaptively shifting the classification decision boundary toward the difficult examples. Simulation analyses on several machine learning data sets show the effectiveness of this method across five evaluation metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI