WeatherBench 2: A benchmark for the next generation of data-driven global weather models
About
WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and state-of-the-art models: https://sites.research.google/weatherbench. This paper describes the design principles of the evaluation framework and presents results for current state-of-the-art physical and data-driven weather models. The metrics are based on established practices for evaluating weather forecasts at leading operational weather centers. We define a set of headline scores to provide an overview of model performance. In addition, we also discuss caveats in the current evaluation setup and challenges for the future of data-driven weather forecasting.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 10m Wind Speed Forecasting | ERA5 | RMSE2.43 | 20 | |
| 2m Temperature Forecasting | ERA5 | RMSE2.61 | 20 | |
| 500hPa Geopotential Forecasting | ERA5 | RMSE820 | 20 | |
| 850hPa Temperature Forecasting | ERA5 | RMSE (850hPa Temp)3.41 | 20 | |
| Precipitation Regression | WeatherBench 2 (WB2) (test) | RMSE (0-6h)1.29 | 3 |