计算机科学
机器学习
多数决原则
稳健性(进化)
集合预报
人工智能
集成学习
噪音(视频)
数据挖掘
原始数据
图像(数学)
生物化学
化学
基因
程序设计语言
作者
Jianrong Yao,Zhongyi Wang,Lu Wang,Meng Liu,Hui Jiang,Yuangao Chen
标识
DOI:10.1016/j.eswa.2022.116913
摘要
Credit scoring is used to help financial institutions control default risks and reduce economic losses, and a variety of mainstream machine learning and data mining algorithms have been applied for this purpose. However, real-world datasets are generally noisy, which seriously affects the performance of credit scoring models. Among the mainstream strategies for handling noise, instance filtering may result in information loss, especially for hard-to-access credit datasets, and label noise correction may produce erroneous information in the dataset. In this study, to reduce the adverse impact of noisy data on the performance of classification algorithms, a novel hybrid ensemble credit scoring model with stacking-based noise detection and weight assignment is developed to remove or adapt noisy data in raw datasets and to form noise-detected training data to obtain excellent default risk prediction competence. Furthermore, a new weight assignment approach based on a cloud model is proposed, which is applied to calculate the weight values of the classifiers in the weighted voting ensemble model to improve the prediction accuracy of the proposed model. In this study, five public datasets are adopted using five performance metrics to evaluate the performance of the proposed model. The experimental results demonstrate good model prediction power and robustness.
科研通智能强力驱动
Strongly Powered by AbleSci AI