词汇化
重新使用
词典
计算机科学
词(群论)
意义(存在)
过程(计算)
钥匙(锁)
自然语言处理
扩展(谓词逻辑)
语言学
人工智能
心理学
工程类
程序设计语言
计算机安全
心理治疗师
哲学
废物管理
作者
Aotao Xu,Charles Kemp,Lea Frermann,Yang Xu
标识
DOI:10.1073/pnas.2406971121
摘要
A key function of the lexicon is to express novel concepts as they emerge over time through a process known as lexicalization. The most common lexicalization strategies are the reuse and combination of existing words, but they have typically been studied separately in the areas of word meaning extension and word formation. Here, we offer an information-theoretic account of how both strategies are constrained by a fundamental tradeoff between competing communicative pressures: Word reuse tends to preserve the average length of word forms at the cost of less precision, while word combination tends to produce more informative words at the expense of greater word length. We test our proposal against a large dataset of reuse items and compounds that appeared in English, French, and Finnish over the past century. We find that these historically emerging items achieve higher levels of communicative efficiency than hypothetical ways of constructing the lexicon, and both literal reuse items and compounds tend to be more efficient than their nonliteral counterparts. These results suggest that reuse and combination are both consistent with a unified account of lexicalization grounded in the theory of efficient communication.
科研通智能强力驱动
Strongly Powered by AbleSci AI