可解释性
计算机科学
水准点(测量)
嵌入
机器学习
人工智能
特征(语言学)
决策树
树(集合论)
数据挖掘
数学
语言学
数学分析
哲学
大地测量学
地理
作者
Ying Gao,Huining Xiao,Choujun Zhan,Liang Li,Wentian Cai,Xiping Hu
标识
DOI:10.1016/j.ins.2023.119447
摘要
Credit transactions are vital financial activities that yield substantial economic benefits. To further improve lending decisions, stakeholders require accurate and interpretable credit scoring methods. While the majority of previous studies have focused on the relationship between individual features and credit risk, only a few have investigated cross-features. Notably, cross-features can not only represent structured data effectively but also provide richer semantic information than individual features. Nevertheless, most previous methods for learning cross-feature effects from credit data have been implicit and unexplainable. This paper proposes a new credit scoring model based on contrastive augmentation and tree-enhanced embedding mechanisms, termed CATE. The proposed model automatically constructs explainable cross-features by using tree-based models to learn decision rules from the data. Moreover, the importance of each local cross-feature is then derived through an attention mechanism. Finally, the credit score of a user is evaluated using embedding vectors. Experimental results on 4 public datasets demonstrated the interpretability of our proposed method and outperformed 13 state-of-the-art benchmark methods in terms of performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI