期刊:The journal of prediction markets [University of Buckingham Press] 日期:2018-12-05卷期号:12 (2): 85-98被引量:5
标识
DOI:10.5750/jpm.v12i2.1608
摘要
Research predicting National Hockey League average attendance is presented. The seasons examined are the 2013 hockey season through the beginning of the 2017 hockey season. Multiple linear regression and three machine learning algorithms – random forest, M5 prime, and extreme gradient boosting – are employed to predict out-of-sample average home game attendance. Extreme gradient boosting generated the lowest out-of-sample root mean square error. The team identifier (team name), the number of Twitter followers (a surrogate for team popularity), median ticket price, and arena capacity have appeared as the top four predictor variables.