MTS-Mixers: Multivariate Time Series Forecasting via Factorized Temporal and Channel Mixing

About

Multivariate time series forecasting has been widely used in various practical scenarios. Recently, Transformer-based models have shown significant potential in forecasting tasks due to the capture of long-range dependencies. However, recent studies in the vision and NLP fields show that the role of attention modules is not clear, which can be replaced by other token aggregation operations. This paper investigates the contributions and deficiencies of attention mechanisms on the performance of time series forecasting. Specifically, we find that (1) attention is not necessary for capturing temporal dependencies, (2) the entanglement and redundancy in the capture of temporal and channel interaction affect the forecasting performance, and (3) it is important to model the mapping between the input and the prediction sequence. To this end, we propose MTS-Mixers, which use two factorized modules to capture temporal and channel dependencies. Experimental results on several real-world datasets show that MTS-Mixers outperform existing Transformer-based models with higher efficiency.

Zhe Li, Zhongwen Rao, Lujia Pan, Zenglin Xu• 2023

Related benchmarks

Task	Dataset	Result
Multivariate long-term forecasting	ETTh1	MSE0.461	472
Multivariate long-term series forecasting	Weather	MSE0.254	425
Multivariate long-term forecasting	Traffic	MSE0.539	190
Time Series Forecasting	PeMS03	MSE0.117	176
Time Series Forecasting	PeMS07	MSE0.134	168
Short-term forecasting	PeMS04	MSE0.129	58
Short-term forecasting	PeMS08	MSE0.186	58
Multivariate long-term forecasting	ETTh1 T=720 (test)	MSE0.51	51
Multivariate long-term forecasting	ETTh1 T=96 (test)	MSE0.397	48
Multivariate long-term forecasting	ETTh1 T=192 (test)	MSE0.452	37

Showing 10 of 51 rows

Other info

Follow for update

@wizwand_team Discord