数学
分位数
分位数回归
极值理论
广义帕累托分布
统计
协变量
条件概率分布
梯度升压
外推法
二项回归
计量经济学
偏差(统计)
回归分析
随机森林
计算机科学
人工智能
作者
Jasper Velthoen,Clément Dombry,Juan‐Juan Cai,Sebastian Engelke
出处
期刊:Extremes
[Springer Nature]
日期:2023-07-21
卷期号:26 (4): 639-667
被引量:40
标识
DOI:10.1007/s10687-023-00473-x
摘要
Abstract Extreme quantile regression provides estimates of conditional quantiles outside the range of the data. Classical quantile regression performs poorly in such cases since data in the tail region are too scarce. Extreme value theory is used for extrapolation beyond the range of observed values and estimation of conditional extreme quantiles. Based on the peaks-over-threshold approach, the conditional distribution above a high threshold is approximated by a generalized Pareto distribution with covariate dependent parameters. We propose a gradient boosting procedure to estimate a conditional generalized Pareto distribution by minimizing its deviance. Cross-validation is used for the choice of tuning parameters such as the number of trees and the tree depths. We discuss diagnostic plots such as variable importance and partial dependence plots, which help to interpret the fitted models. In simulation studies we show that our gradient boosting procedure outperforms classical methods from quantile regression and extreme value theory, especially for high-dimensional predictor spaces and complex parameter response surfaces. An application to statistical post-processing of weather forecasts with precipitation data in the Netherlands is proposed.
科研通智能强力驱动
Strongly Powered by AbleSci AI