纳克
计算机科学
班级(哲学)
自然语言处理
词(群论)
人工智能
自然语言
克
样品(材料)
按频率列出的单词列表
语言学
语言模型
哲学
化学
生物
遗传学
细菌
色谱法
判决
作者
Peter F. Brown,P.V. deSouza,Robert L. Mercer,Vincent J. Della Pietra,Jenifer C. Lai
标识
DOI:10.5555/176313.176316
摘要
We address the problem of predicting a word from previous words in a sample of text. In particular, we discuss n-gram models based on classes of words. We also discuss several statistical algorithms for assigning words to classes based on the frequency of their co-occurrence with other words. We find that we are able to extract classes that have the flavor of either syntactically based groupings or semantically based groupings, depending on the nature of the underlying statistics.
科研通智能强力驱动
Strongly Powered by AbleSci AI