Meta-learning framework with applications to zero-shot time-series forecasting

About

Can meta-learning discover generic ways of processing time series (TS) from a diverse dataset so as to greatly improve generalization on new TS coming from different datasets? This work provides positive evidence to this using a broad meta-learning framework which we show subsumes many existing meta-learning algorithms. Our theoretical analysis suggests that residual connections act as a meta-learning adaptation mechanism, generating a subset of task-specific parameters based on a given TS input, thus gradually expanding the expressive power of the architecture on-the-fly. The same mechanism is shown via linearization analysis to have the interpretation of a sequential update of the final linear layer. Our empirical results on a wide range of data emphasize the importance of the identified meta-learning mechanisms for successful zero-shot univariate forecasting, suggesting that it is viable to train a neural network on a source TS dataset and deploy it on a different target TS dataset without retraining, resulting in performance that is at least as good as that of state-of-practice univariate forecasting models.

Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio• 2020

Related benchmarks

Task	Dataset	Result
Time Series Forecasting	ETTh1	MSE0.177	836
Time Series Forecasting	ETTh2	MSE0.48	796
Time Series Forecasting	Weather	MSE0.014	497
Time Series Forecasting	ECL	MSE0.909	294
Time Series Forecasting	Exchange	MSE0.023	227
Time Series Forecasting	Traffic	MSE2.913	211
Time Series Forecasting	Illness	MSE1.301	69

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord