Healthcare insurance fraud detection using data mining

健康信息学 医疗保健 数据科学 计算机科学 数据挖掘 业务 经济 经济增长
作者
Zain Hamid,Fatima Khalique,Saba Mahmood,Ali Daud,Amal Bukhari,Bader Alshemaimri
出处
期刊:BMC Medical Informatics and Decision Making [BioMed Central]
卷期号:24 (1) 被引量:3
标识
DOI:10.1186/s12911-024-02512-4
摘要

Abstract Background Healthcare programs and insurance initiatives play a crucial role in ensuring that people have access to medical care. There are many benefits of healthcare insurance programs but fraud in healthcare continues to be a significant challenge in the insurance industry. Healthcare insurance fraud detection faces challenges from evolving and sophisticated fraud schemes that adapt to detection methods. Analyzing extensive healthcare data is hindered by complexity, data quality issues, and the need for real-time detection, while privacy concerns and false positives pose additional hurdles. The lack of standardization in coding and limited resources further complicate efforts to address fraudulent activities effectively. Methodolgy In this study, a fraud detection methodology is presented that utilizes association rule mining augmented with unsupervised learning techniques to detect healthcare insurance fraud. Dataset from the Centres for Medicare and Medicaid Services (CMS) 2008-2010 DE-SynPUF is used for analysis. The proposed methodology works in two stages. First, association rule mining is used to extract frequent rules from the transactions based on patient, service and service provider features. Second, the extracted rules are passed to unsupervised classifiers, such as IF, CBLOF, ECOD, and OCSVM, to identify fraudulent activity. Results Descriptive analysis shows patterns and trends in the data revealing interesting relationship among diagnosis codes, procedure codes and the physicians. The baseline anomaly detection algorithms generated results in 902.24 seconds. Another experiment retrieved frequent rules using association rule mining with apriori algorithm combined with unsupervised techniques in 868.18 seconds. The silhouette scoring method calculated the efficacy of four different anomaly detection techniques showing CBLOF with highest score of 0.114 followed by isolation forest with the score of 0.103. The ECOD and OCSVM techniques have lower scores of 0.063 and 0.060, respectively. Conclusion The proposed methodology enhances healthcare insurance fraud detection by using association rule mining for pattern discovery and unsupervised classifiers for effective anomaly detection.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
WYN完成签到,获得积分10
刚刚
1秒前
研友_VZG7GZ应助高大的二娘采纳,获得10
2秒前
2秒前
科研通AI6.4应助康爽采纳,获得10
3秒前
白立轩发布了新的文献求助10
4秒前
完美世界应助慈祥的信封采纳,获得10
4秒前
Lucas应助躺平摆烂采纳,获得10
4秒前
爆米花应助生生采纳,获得10
5秒前
5秒前
Hello应助开朗醉波采纳,获得10
6秒前
bkagyin应助x甜豆采纳,获得10
6秒前
6秒前
张欢馨应助柳绿柳采纳,获得10
6秒前
飞龙爵士发布了新的文献求助10
8秒前
小王同学发布了新的文献求助10
8秒前
10秒前
32发布了新的文献求助10
12秒前
12秒前
13秒前
13秒前
14秒前
14秒前
xiaofei应助沉默笑蓝采纳,获得10
15秒前
15秒前
16秒前
赵小胖完成签到,获得积分10
16秒前
17秒前
躺平摆烂发布了新的文献求助10
17秒前
charolte发布了新的文献求助10
18秒前
18秒前
18秒前
听雨发布了新的文献求助10
18秒前
犬来八荒完成签到,获得积分10
19秒前
19秒前
19秒前
生生发布了新的文献求助10
19秒前
zhang完成签到,获得积分10
19秒前
神速闪电发布了新的文献求助10
22秒前
徐七鹏发布了新的文献求助10
22秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Rheumatoid arthritis drugs market analysis North America, Europe, Asia, Rest of world (ROW)-US, UK, Germany, France, China-size and Forecast 2024-2028 500
17α-Methyltestosterone Immersion Induces Sex Reversal in Female Mandarin Fish (Siniperca Chuatsi) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6365083
求助须知:如何正确求助?哪些是违规求助? 8179093
关于积分的说明 17240002
捐赠科研通 5420187
什么是DOI,文献DOI怎么找? 2867869
邀请新用户注册赠送积分活动 1844933
关于科研通互助平台的介绍 1692443