随机森林
协变量
排名(信息检索)
计算机科学
泊松分布
随机效应模型
泊松回归
统计
预测能力
锦标赛
可能性
数据集
集合(抽象数据类型)
计量经济学
机器学习
数据挖掘
数学
人工智能
逻辑回归
人口
组合数学
程序设计语言
医学
荟萃分析
认识论
人口学
社会学
哲学
内科学
作者
Andreas Groll,Christophe Ley,Gunther Schauberger,Hans Van Eetvelde
出处
期刊:Journal of Quantitative Analysis in Sports
[De Gruyter]
日期:2019-07-10
卷期号:15 (4): 271-287
被引量:40
标识
DOI:10.1515/jqas-2018-0060
摘要
Abstract In this work, we propose a new hybrid modeling approach for the scores of international soccer matches which combines random forests with Poisson ranking methods . While the random forest is based on the competing teams’ covariate information, the latter method estimates ability parameters on historical match data that adequately reflect the current strength of the teams. We compare the new hybrid random forest model to its separate building blocks as well as to conventional Poisson regression models with regard to their predictive performance on all matches from the four FIFA World Cups 2002–2014. It turns out that by combining the random forest with the team ability parameters from the ranking methods as an additional covariate the predictive power can be improved substantially. Finally, the hybrid random forest is used (in advance of the tournament) to predict the FIFA World Cup 2018. To complete our analysis on the previous World Cup data, the corresponding 64 matches serve as an independent validation data set and we are able to confirm the compelling predictive potential of the hybrid random forest which clearly outperforms all other methods including the betting odds.
科研通智能强力驱动
Strongly Powered by AbleSci AI