Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TSMixer: An All-MLP Architecture for Time Series Forecasting

About

Real-world time-series datasets are often multivariate with complex dynamics. To capture this complexity, high capacity architectures like recurrent- or attention-based sequential deep learning models have become popular. However, recent work demonstrates that simple univariate linear models can outperform such deep learning models on several commonly used academic benchmarks. Extending them, in this paper, we investigate the capabilities of linear models for time-series forecasting and present Time-Series Mixer (TSMixer), a novel architecture designed by stacking multi-layer perceptrons (MLPs). TSMixer is based on mixing operations along both the time and feature dimensions to extract information efficiently. On popular academic benchmarks, the simple-to-implement TSMixer is comparable to specialized state-of-the-art models that leverage the inductive biases of specific benchmarks. On the challenging and large scale M5 benchmark, a real-world retail dataset, TSMixer demonstrates superior performance compared to the state-of-the-art alternatives. Our results underline the importance of efficiently utilizing cross-variate and auxiliary information for improving the performance of time series forecasting. We present various analyses to shed light into the capabilities of TSMixer. The design paradigms utilized in TSMixer are expected to open new horizons for deep learning-based time series forecasting. The implementation is available at https://github.com/google-research/google-research/tree/master/tsmixer

Si-An Chen, Chun-Liang Li, Nate Yoder, Sercan O. Arik, Tomas Pfister• 2023

Related benchmarks

TaskDatasetResultRank
Multivariate long-term series forecastingWeather (test)
MSE0.189
269
Multivariate long-term series forecastingTraffic (test)
MSE0.409
219
Multivariate long-term series forecastingElectricity (test)
MSE0.171
166
Multivariate long-term series forecastingETTm2 (test)
MSE0.18
150
Multivariate long-term series forecastingExchange (test)
MSE0.139
145
Multivariate long-term forecastingETTm1 (test)
MSE0.326
134
Multivariate long-term forecastingETTh1 (test)
MSE0.376
77
Multivariate long-term forecastingETTh2 (test)
MSE0.305
76
Cooling load forecastingDCData 100% train data 1.0 (test)
MSE0.0134
48
Cooling load forecastingDCData 50% train data 1.0 (test)
MSE0.0402
44
Showing 10 of 42 rows

Other info

Code

Follow for update