计算机科学
依赖关系(UML)
鉴定(生物学)
可解释性
人气
人工智能
独创性
自然语言处理
词汇
名词
信息抽取
数据科学
计算语言学
2019年冠状病毒病(COVID-19)
机器学习
情报检索
语言学
社会科学
定性研究
社会学
病理
哲学
生物
社会心理学
传染病(医学专业)
医学
植物
疾病
心理学
出处
期刊:Library Hi Tech
[Emerald (MCB UP)]
日期:2021-08-23
卷期号:40 (2): 495-515
被引量:16
标识
DOI:10.1108/lht-01-2021-0051
摘要
Purpose Previous research concerning automatic extraction of research topics mostly used rule-based or topic modeling methods, which were challenged due to the limited rules, the interpretability issue and the heavy dependence on human judgment. This study aims to address these issues with the proposal of a new method that integrates machine learning models with linguistic features for the identification of research topics. Design/methodology/approach First, dependency relations were used to extract noun phrases from research article texts. Second, the extracted noun phrases were classified into topics and non-topics via machine learning models and linguistic and bibliometric features. Lastly, a trend analysis was performed to identify hot research topics, i.e. topics with increasing popularity. Findings The new method was experimented on a large dataset of COVID-19 research articles and achieved satisfactory results in terms of f -measures, accuracy and AUC values. Hot topics of COVID-19 research were also detected based on the classification results. Originality/value This study demonstrates that information retrieval methods can help researchers gain a better understanding of the latest trends in both COVID-19 and other research areas. The findings are significant to both researchers and policymakers.
科研通智能强力驱动
Strongly Powered by AbleSci AI