GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation

About

Time series foundation models excel in zero-shot forecasting, handling diverse tasks without explicit training. However, the advancement of these models has been hindered by the lack of comprehensive benchmarks. To address this gap, we introduce the General Time Series Forecasting Model Evaluation, GIFT-Eval, a pioneering benchmark aimed at promoting evaluation across diverse datasets. GIFT-Eval encompasses 23 datasets over 144,000 time series and 177 million data points, spanning seven domains, 10 frequencies, multivariate inputs, and prediction lengths ranging from short to long-term forecasts. To facilitate the effective pretraining and evaluation of foundation models, we also provide a non-leaking pretraining dataset containing approximately 230 billion data points. Additionally, we provide a comprehensive analysis of 17 baselines, which includes statistical models, deep learning models, and foundation models. We discuss each model in the context of various benchmark characteristics and offer a qualitative analysis that spans both deep learning and foundation models. We believe the insights from this analysis, along with access to this new standard zero-shot time series forecasting benchmark, will guide future developments in time series foundation models. Code, data, and the leaderboard can be found at https://github.com/SalesforceAIResearch/gift-eval .

Taha Aksu, Gerald Woo, Juncheng Liu, Xu Liu, Chenghao Liu, Silvio Savarese, Caiming Xiong, Doyen Sahoo• 2024

Related benchmarks

Task	Dataset	Result
Time Series Forecasting	GIFT-Eval (test)	--	63
Forecasting	GIFT-Eval Multivariate	Relative MAPE1	13
Forecasting	GIFT-Eval L Horizon	Relative MAPE1	13
Forecasting	GIFT-Eval M Horizon	Relative MAPE1	13
Forecasting	GIFT-Eval All	Relative MAPE1	13
Forecasting	M4 hourly 48	Relative MAPE1	13
Forecasting	GIFT-Eval S Horizon	Relative MAPE1	13
Forecasting	Electricity 480	Relative MAPE1	13
Forecasting	Electricity 720	Relative MAPE1	13
Forecasting	GIFT-Eval Univariate	Relative MAPE1	13

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord