大数据
统计分析
计算机科学
数据收集
数据科学
过程(计算)
万维网
情报检索
数据挖掘
统计
数学
操作系统
出处
期刊:Wiadomości Statystyczne
日期:2019-09-30
卷期号:64 (9): 60-74
被引量:1
标识
DOI:10.5604/01.3001.0013.7590
摘要
The goal of this paper is to present, on the one hand, the benefits for offi-cial statistics (labour market) resulting from the use of web scraping methods to gather data on job advertisements from websites belonging to big data compilations, and on the other, the challenges connected to this process. The paper introduces the results of experimental research where web-scraping and text-mining methods were adopted. The analysis was based on the data from 2017–2018 obtained from the most popular job-searching websites, which was then collated with Statistics Poland’s data obtained from Z-05 forms. The above-mentioned analysis demonstrated that web-scraping methods canbe adopted by public statistics services to obtain statistical data from alternative sourcescomplementing the already-existing databases, providing the findings of such researchremain coherent with the results of the already-existing studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI