Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Evaluating the Discrimination Ability of Proper Multivariate Scoring Rules

About

Proper scoring rules are commonly applied to quantify the accuracy of distribution forecasts. Given an observation they assign a scalar score to each distribution forecast, with the the lowest expected score attributed to the true distribution. The energy and variogram scores are two rules that have recently gained some popularity in multivariate settings because their computation does not require a forecast to have parametric density function and so they are broadly applicable. Here we conduct a simulation study to compare the discrimination ability between the energy score and three variogram scores. Compared with other studies, our simulation design is more realistic because it is supported by a historical data set containing commodity prices, currencies and interest rates, and our data generating processes include a diverse selection of models with different marginal distributions, dependence structure, and calibration windows. This facilitates a comprehensive comparison of the performance of proper scoring rules in different settings. To compare the scores we use three metrics: the mean relative score, error rate and a generalised discrimination heuristic. Overall, we find that the variogram score with parameter p=0.5 outperforms the energy score and the other two variogram scores.

Carol Alexander, Michael Coulon, Yang Han, Xiaochun Meng• 2021

Related benchmarks

TaskDatasetResultRank
Probabilistic Electricity Price ForecastingGerman electricity price day-ahead market n = 736 (First sub-sample: 2018-12-27 to 2020-12-31)
RMSE7.751
32
Electricity Price ForecastingGerman day-ahead market Panel A Full sample (2018-12-27 to 2023-12-31)
RMSE31.268
16
Electricity Price ForecastingGerman day-ahead market Panel C: Second sub-sample n = 1095 (2021-01-01 to 2023-12-31)
RMSE39.931
16
Probabilistic Electricity Price ForecastingGerman day-ahead market 2021-2023 (Second sub-sample)
RMSE39.931
16
Showing 4 of 4 rows

Other info

Follow for update