Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MTS-Mixers: Multivariate Time Series Forecasting via Factorized Temporal and Channel Mixing

About

Multivariate time series forecasting has been widely used in various practical scenarios. Recently, Transformer-based models have shown significant potential in forecasting tasks due to the capture of long-range dependencies. However, recent studies in the vision and NLP fields show that the role of attention modules is not clear, which can be replaced by other token aggregation operations. This paper investigates the contributions and deficiencies of attention mechanisms on the performance of time series forecasting. Specifically, we find that (1) attention is not necessary for capturing temporal dependencies, (2) the entanglement and redundancy in the capture of temporal and channel interaction affect the forecasting performance, and (3) it is important to model the mapping between the input and the prediction sequence. To this end, we propose MTS-Mixers, which use two factorized modules to capture temporal and channel dependencies. Experimental results on several real-world datasets show that MTS-Mixers outperform existing Transformer-based models with higher efficiency.

Zhe Li, Zhongwen Rao, Lujia Pan, Zenglin Xu• 2023

Related benchmarks

TaskDatasetResultRank
Multivariate long-term forecastingETTh1
MSE0.461
344
Multivariate long-term series forecastingWeather
MSE0.254
288
Multivariate long-term forecastingTraffic
MSE0.539
159
Time Series ForecastingPeMS03
MSE0.117
82
Time Series ForecastingPeMS07
MSE0.134
82
Multivariate long-term forecastingETTh1 T=720 (test)
MSE0.51
51
Multivariate long-term forecastingETTh1 T=96 (test)
MSE0.397
48
Short-term forecastingPeMS04
MSE0.129
47
Short-term forecastingPeMS08
MSE0.186
47
Multivariate long-term forecastingETTh1 T=192 (test)
MSE0.452
37
Showing 10 of 51 rows

Other info

Follow for update