计算机科学
推定
主题模型
阅读(过程)
数据科学
人工智能
自然语言处理
语言学
哲学
政治学
法学
作者
Bolun Zhang,Yimang Zhou,Dai Li
标识
DOI:10.1177/00811750241265336
摘要
Validation is at the heart of methodological discussions about topic modeling. The authors argue that validation based on human reading hinges on distinctive words and readers’ labeling of a topic, and it overlooks the probability of conflicting results from semantically similar models, such as regressions or other methods. This runs counter to the presumption that topic modeling can reveal features of documents that have some measurable association with social aspects outside the text. The authors develop a similar topic identifying procedure to verify that semantically similar solutions yield similar results in further analysis. The authors argue that future validations of topic modeling must consider such procedures.
科研通智能强力驱动
Strongly Powered by AbleSci AI