阿卡克信息准则
过度拟合
贝叶斯信息准则
自回归模型
系列(地层学)
贝叶斯概率
交叉验证
选型
统计
计算机科学
信息标准
计量经济学
机器学习
数学
人工智能
人工神经网络
生物
古生物学
摘要
Abstract Vector autoregressive (VAR) modelling is widely employed in psychology for time series analyses of dynamic processes. However, the typically short time series in psychological studies can lead to overfitting of VAR models, impairing their predictive ability on unseen samples. Cross‐validation (CV) methods are commonly recommended for assessing the predictive ability of statistical models. However, it is unclear how the performance of CV is affected by characteristics of time series data and the fitted models. In this simulation study, we examine the ability of two CV methods, namely,10‐fold CV and blocked CV, in estimating the prediction errors of three time series models with increasing complexity (person‐mean, AR, and VAR), and evaluate how their performance is affected by data characteristics. We then compare these CV methods to the traditional methods using the Akaike (AIC) and Bayesian (BIC) information criteria in their accuracy of selecting the most predictive models. We find that CV methods tend to underestimate prediction errors of simpler models, but overestimate prediction errors of VAR models, particularly when the number of observations is small. Nonetheless, CV methods, especially blocked CV, generally outperform the AIC and BIC. We conclude our study with a discussion on the implications of the findings and provide helpful guidelines for practice.
科研通智能强力驱动
Strongly Powered by AbleSci AI