LiFoL: An Efficient Framework for Financial Distress Prediction in High-Dimensional Unbalanced Scenario

可解释性 增采样 计算机科学 特征(语言学) 人工智能 维数之咒 机器学习 班级(哲学) 数据挖掘 语言学 图像(数学) 哲学
作者
Jianyong Wang,Xiaojun Kuang,Jifeng Guo
出处
期刊:IEEE Transactions on Computational Social Systems [Institute of Electrical and Electronics Engineers]
卷期号:: 1-12
标识
DOI:10.1109/tcss.2023.3276059
摘要

Corporate financial distress will significantly damage the company’s and its stakeholders’ interests and even lead to a global financial crisis. Therefore, finding an efficient method for financial distress prediction (FDP) to avoid greater losses is essential. Although there is a lot of research and progress in this field, the existing methods rarely consider the problems of high dimensionality and class imbalance, which will largely limit the models to achieve satisfactory performance. To alleviate these problems, this article first proposes a novel Lightspace-SMOTE upsampling method, which can reduce the feature dimensionality and increase the signal-to-noise ratio (SNR) of the original data and then upsample it to increase the number of minor class samples. In addition, this article proposes an efficient ensemble framework (LiFoL) that combines Lightspace-SMOTE, focal loss (FL), and LightGBM, which can not only focus more on minor class and the hard-to-class samples but also obtain better performance. At the same time, the feature importance provided by the model can provide strong support for model interpretability. Experimental results show that the Lightspace-SMOTE upsampling method can help the model achieve higher scores in area under ROC curve (AUC) and recall, especially in the case of longer prediction periods. Compared with current methods, LiFoL can achieve more than 10% improvement in AUC and more than 20% in recall.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
萧然完成签到,获得积分10
1秒前
4秒前
wuda完成签到,获得积分10
5秒前
小邓完成签到,获得积分10
5秒前
Kingzd完成签到,获得积分10
6秒前
伊比利亚黑毛猪黑松露芝士火腿完成签到,获得积分10
6秒前
蝈蝈完成签到,获得积分10
7秒前
bcsunny2022完成签到,获得积分10
7秒前
小绵羊完成签到,获得积分20
8秒前
Ou完成签到,获得积分10
8秒前
pick_up完成签到,获得积分10
10秒前
xiaowang发布了新的文献求助10
11秒前
英俊的铭应助Sandy采纳,获得10
11秒前
gogogo完成签到,获得积分10
11秒前
科研通AI2S应助小绵羊采纳,获得10
12秒前
张欢馨应助小绵羊采纳,获得10
12秒前
Owen应助小绵羊采纳,获得10
12秒前
lulu完成签到 ,获得积分10
12秒前
xue112完成签到 ,获得积分0
13秒前
zyx完成签到,获得积分10
14秒前
飞草发布了新的文献求助10
15秒前
清脆的绝悟完成签到,获得积分10
15秒前
黑咖啡完成签到,获得积分10
16秒前
怕黑的砖家完成签到 ,获得积分10
17秒前
诸葛烤鸭完成签到,获得积分10
19秒前
19秒前
SGLY完成签到,获得积分10
20秒前
123完成签到,获得积分10
21秒前
缥缈的雁枫完成签到,获得积分10
21秒前
后陡门小学生完成签到 ,获得积分10
21秒前
21秒前
韶邑完成签到,获得积分10
22秒前
宇是眼中星眸完成签到 ,获得积分10
22秒前
lifuyi291完成签到,获得积分10
23秒前
小木子完成签到,获得积分10
25秒前
25秒前
憨憨且老刘完成签到,获得积分10
28秒前
29秒前
大椒完成签到 ,获得积分10
30秒前
awen完成签到,获得积分10
31秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
CLSI M100 Performance Standards for Antimicrobial Susceptibility Testing 36th edition 400
Cancer Targets: Novel Therapies and Emerging Research Directions (Part 1) 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6362286
求助须知:如何正确求助?哪些是违规求助? 8176007
关于积分的说明 17224813
捐赠科研通 5416998
什么是DOI,文献DOI怎么找? 2866674
邀请新用户注册赠送积分活动 1843775
关于科研通互助平台的介绍 1691614