多重共线性
推论
变化(天文学)
复数
因果推理
随机森林
计量经济学
动词
口译(哲学)
适度
统计
变量(数学)
计算机科学
语言学
自然语言处理
人工智能
数学
机器学习
线性回归
哲学
数学分析
物理
程序设计语言
天体物理学
作者
Sali A. Tagliamonte,R. Harald Baayen
出处
期刊:Language Variation and Change
[Cambridge University Press]
日期:2012-07-01
卷期号:24 (2): 135-178
被引量:396
标识
DOI:10.1017/s0954394512000129
摘要
Abstract What is the explanation for vigorous variation between was and were in plural existential constructions, and what is the optimal tool for analyzing it? Previous studies of this phenomenon have used the variable rule program, a generalized linear model; however, recent developments in statistics have introduced new tools, including mixed-effects models, random forests, and conditional inference trees that may open additional possibilities for data exploration, analysis, and interpretation. In a step-by-step demonstration, we show how this well-known variable benefits from these complementary techniques. Mixed-effects models provide a principled way of assessing the importance of random-effect factors such as the individuals in the sample. Random forests provide information about the importance of predictors, whether factorial or continuous, and do so also for unbalanced designs with high multicollinearity, cases for which the family of linear models is less appropriate. Conditional inference trees straightforwardly visualize how multiple predictors operate in tandem. Taken together, the results confirm that polarity, distance from verb to plural element, and the nature of the DP are significant predictors. Ongoing linguistic change and social reallocation via morphologization are operational. Furthermore, the results make predictions that can be tested in future research. We conclude that variationist research can be substantially enriched by an expanded tool kit.
科研通智能强力驱动
Strongly Powered by AbleSci AI