Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting

About

Transformers have gained popularity in time series forecasting for their ability to capture long-sequence interactions. However, their high memory and computing requirements pose a critical bottleneck for long-term forecasting. To address this, we propose TSMixer, a lightweight neural architecture exclusively composed of multi-layer perceptron (MLP) modules for multivariate forecasting and representation learning on patched time series. Inspired by MLP-Mixer's success in computer vision, we adapt it for time series, addressing challenges and introducing validated components for enhanced accuracy. This includes a novel design paradigm of attaching online reconciliation heads to the MLP-Mixer backbone, for explicitly modeling the time-series properties such as hierarchy and channel-correlations. We also propose a novel Hybrid channel modeling and infusion of a simple gating approach to effectively handle noisy channel interactions and generalization across diverse datasets. By incorporating these lightweight components, we significantly enhance the learning capability of simple MLP structures, outperforming complex Transformer models with minimal computing usage. Moreover, TSMixer's modular design enables compatibility with both supervised and masked self-supervised learning methods, making it a promising building block for time-series Foundation Models. TSMixer outperforms state-of-the-art MLP and Transformer models in forecasting by a considerable margin of 8-60%. It also outperforms the latest strong benchmarks of Patch-Transformer models (by 1-2%) with a significant reduction in memory and runtime (2-3X). The source code of our model is officially released as PatchTSMixer in the HuggingFace. Model: https://huggingface.co/docs/transformers/main/en/model_doc/patchtsmixer Examples: https://github.com/ibm/tsfm/#notebooks-links

Vijay Ekambaram, Arindam Jati, Nam Nguyen, Phanwadee Sinthong, Jayant Kalagnanam• 2023

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.314
645
Time Series ForecastingETTh1
MSE0.412
601
Time Series ForecastingETTh2
MSE0.341
438
Multivariate Time-series ForecastingETTm1
MSE0.291
433
Time Series ForecastingETTm2
MSE0.25
382
Multivariate ForecastingETTh2
MSE0.402
341
Multivariate Time-series ForecastingETTm2
MSE0.164
334
Multivariate long-term series forecastingETTh2
MSE0.276
319
Multivariate Time-series ForecastingWeather
MSE0.088
276
Long-term time-series forecastingETTh1 (test)
MSE0.63
221
Showing 10 of 48 rows

Other info

Follow for update