潜在Dirichlet分配
主题模型
计算机科学
聚类分析
领域(数学)
数据科学
等级制度
文献计量学
系统回顾
财务报表
管理科学
人工智能
会计
数据挖掘
政治学
业务
工程类
数学
审计
梅德林
纯数学
法学
作者
Milad Soltani,Alexios Kythreotis,Arash Roshanpoor
出处
期刊:Journal of Financial Crime
[Emerald (MCB UP)]
日期:2023-04-19
卷期号:30 (5): 1367-1388
被引量:15
标识
DOI:10.1108/jfc-09-2022-0227
摘要
Purpose The emergence of machine learning has opened a new way for researchers. It allows them to supplement the traditional manual methods for conducting a literature review and turning it into smart literature. This study aims to present a framework for incorporating machine learning into financial statement fraud (FSF) literature analysis. This framework facilitates the analysis of a large amount of literature to show the trend of the field and identify the most productive authors, journals and potential areas for future research. Design/methodology/approach In this study, a framework was introduced that merges bibliometric analysis techniques such as word frequency, co-word analysis and coauthorship analysis with the Latent Dirichlet Allocation topic modeling approach. This framework was used to uncover subtopics from 20 years of financial fraud research articles. Furthermore, the hierarchical clustering method was used on selected subtopics to demonstrate the primary contexts in the literature on FSF. Findings This study has contributed to the literature in two ways. First, this study has determined the top journals, articles, countries and keywords based on various bibliometric metrics. Second, using topic modeling and then hierarchy clustering, this study demonstrates the four primary contexts in FSF detection. Research limitations/implications In this study, the authors tried to comprehensively view the studies related to financial fraud conducted over two decades. However, this research has limitations that can be an opportunity for future researchers. The first limitation is due to language bias. This study has focused on English language articles, so it is suggested that other researchers consider other languages as well. The second limitation is caused by citation bias. In this study, the authors tried to show the top articles based on the citation criteria. However, judging based on citation alone can be misleading. Therefore, this study suggests that the researchers consider other measures to check the citation quality and assess the studies’ precision by applying meta-analysis. Originality/value Despite the popularity of bibliometric analysis and topic modeling, there have been limited efforts to use machine learning for literature review. This novel approach of using hierarchical clustering on topic modeling results enable us to uncover four primary contexts. Furthermore, this method allowed us to show the keywords of each context and highlight significant articles within each context.
科研通智能强力驱动
Strongly Powered by AbleSci AI