随机森林
人工智能
计算机科学
班级(哲学)
集成学习
决策树
机器学习
采样(信号处理)
模式识别(心理学)
统计分类
一级分类
数据挖掘
上下文图像分类
过采样
支持向量机
图像(数学)
滤波器(信号处理)
计算机视觉
带宽(计算)
计算机网络
作者
May Phu Paing,Chuchart Pintavirooj,S. Tungjitkusolmun,Somsak Choomchuay,Kazuhiko Hamamoto
标识
DOI:10.1109/bmeicon.2018.8609946
摘要
Imbalanced data classification is a serious and challenging task for most of the medical image diagnosis applications. They usually produce a larger number of false samples compared to the actual ones. That is the number of samples for the class of interest (minority) is significantly fewer than other types of class (majority). The classification performed using such data is called imbalanced data classification. As a consequence, the learning model bias towards the majority class and fails the classification of the minority class. Data sampling and ensemble methods are common ways to compensate for this issue. Random forest (RF), an ensemble of multiple decision trees, is very famous in both of the classification and regression problems because of its robust and accurate predictions. However, it also suffers class bias in the imbalanced data classification problems. This paper proposes and compares different sampling methods to solve the imbalanced data classification in RF.
科研通智能强力驱动
Strongly Powered by AbleSci AI