计算机科学
概率逻辑
生成语法
词(群论)
跟踪(教育)
生成模型
GSM演进的增强数据速率
人工智能
数据挖掘
数据科学
自然语言处理
机器学习
语言学
社会学
教育学
哲学
作者
Wanying Ding,Chaomei Chen
摘要
Cocitation and co‐word methods have long been used to detect and track emerging topics in scientific literature, but both have weaknesses. Recently, while many researchers have adopted generative probabilistic models for topic detection and tracking, few have compared generative probabilistic models with traditional cocitation and co‐word methods in terms of their overall performance. In this article, we compare the performance of hierarchical D irichlet process ( HDP ), a promising generative probabilistic model, with that of the 2 traditional topic detecting and tracking methods—cocitation analysis and co‐word analysis. We visualize and explore the relationships between topics identified by the 3 methods in hierarchical edge bundling graphs and time flow graphs. Our result shows that HDP is more sensitive and reliable than the other 2 methods in both detecting and tracking emerging topics. Furthermore, we demonstrate the important topics and topic evolution trends in the literature of terrorism research with the HDP method.
科研通智能强力驱动
Strongly Powered by AbleSci AI