潜在Dirichlet分配
相似性(几何)
北京
欧几里德距离
政府(语言学)
主题模型
中国
价值(数学)
经济
计算机科学
政治学
语言学
人工智能
哲学
法学
机器学习
图像(数学)
作者
Junhuan Zhang,Wanbing Gui,Jiaqi Wen
标识
DOI:10.1177/01655515221097858
摘要
This article proposes a combination model, which is composed of latent Dirichlet allocation model, TF-IDF feature extraction algorithm and Euclidean distance measurement method, to identify and judge whether the similarities between multiple policy texts exist or not. With the help of actual data result, this will drive the relevant government agencies to figure out problems in a timely manner and provide a decision-making basis for them to formulate and optimise appropriate economic policies. To this end, this article analyses and studies the four types of economic texts that are classified as Insurance, Banking, Tax and Finance from the Central Government of Hebei province and Shijiazhuang city levels. Also, we consider Beijing, Shanghai and Guangdong. Experimental results show that (1) the combination model can quickly and effectively recognise and determine whether there are similarities between multiple economic policy texts; (2) similarities exist or not between the central, provincial and municipal level policy texts depending on the comparison of the distance values across them; (3) the smaller the distance value between economic policy texts of the same kind, the higher the similarity in them; and (4) the distance values between the six policy texts in Finance, Insurance, Bank and Tax categories are ranked from low to high. In terms of similarity, the Finance category is the highest, followed by Insurance and Bank, and the Tax category is the lowest.
科研通智能强力驱动
Strongly Powered by AbleSci AI