Ister: Linear Transformer for Efficient Multivariate Time Series Forecasting
About
Transformer-based models have achieved remarkable success in multivariate time series forecasting (MTSF) by capturing long-range dependencies. However, their widespread adoption is hindered by the quadratic computational complexity of self-attention, which limits scalability on high-dimensional sequences. To address this challenge, we propose the Inverted Seasonal-Trend Decomposition Transformer (Ister), a novel architecture that enhances both predictive accuracy and computational efficiency. Central to Ister is Dot-attention, a linear-complexity attention mechanism that replaces conventional multi-head self-attention with element-wise dot-product operations to model inter-series dependencies. Furthermore, we introduce an inverted seasonal-trend decomposition strategy that isolates periodic components, enabling the model to focus learning on periodic patterns, thereby improving the performance of channel alignment. Extensive experiments across several real-world benchmarks demonstrate that Ister consistently achieves state-of-the-art performance. Code is available at https://github.com/macovaseas/Ister.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multivariate Forecasting | ETTh1 | MSE0.438 | 830 | |
| Multivariate Time-series Forecasting | ETTm1 | MSE0.386 | 686 | |
| Multivariate Time-series Forecasting | ETTm2 | MSE0.279 | 539 | |
| Multivariate Time-series Forecasting | Weather | MSE0.243 | 409 | |
| Multivariate Time-series Forecasting | Traffic | MSE0.399 | 310 | |
| Multivariate Time-series Forecasting | ETTh2 | MSE0.349 | 198 | |
| Multivariate Time-series Forecasting | PeMS04 | MSE0.106 | 107 | |
| Multivariate Time-series Forecasting | Electricity | MAE0.26 | 105 | |
| Multivariate Time-series Forecasting | PeMS07 | MSE0.092 | 80 | |
| Multivariate Time-series Forecasting | PeMS08 | MSE0.136 | 71 |