Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Ister: Linear Transformer for Efficient Multivariate Time Series Forecasting

About

Transformer-based models have achieved remarkable success in multivariate time series forecasting (MTSF) by capturing long-range dependencies. However, their widespread adoption is hindered by the quadratic computational complexity of self-attention, which limits scalability on high-dimensional sequences. To address this challenge, we propose the Inverted Seasonal-Trend Decomposition Transformer (Ister), a novel architecture that enhances both predictive accuracy and computational efficiency. Central to Ister is Dot-attention, a linear-complexity attention mechanism that replaces conventional multi-head self-attention with element-wise dot-product operations to model inter-series dependencies. Furthermore, we introduce an inverted seasonal-trend decomposition strategy that isolates periodic components, enabling the model to focus learning on periodic patterns, thereby improving the performance of channel alignment. Extensive experiments across several real-world benchmarks demonstrate that Ister consistently achieves state-of-the-art performance. Code is available at https://github.com/macovaseas/Ister.

Fanpu Cao, Shu Yang, Zhengjian Chen, Ye Liu, Laizhong Cui• 2024

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.438
830
Multivariate Time-series ForecastingETTm1
MSE0.386
686
Multivariate Time-series ForecastingETTm2
MSE0.279
539
Multivariate Time-series ForecastingWeather
MSE0.243
409
Multivariate Time-series ForecastingTraffic
MSE0.399
310
Multivariate Time-series ForecastingETTh2
MSE0.349
198
Multivariate Time-series ForecastingPeMS04
MSE0.106
107
Multivariate Time-series ForecastingElectricity
MAE0.26
105
Multivariate Time-series ForecastingPeMS07
MSE0.092
80
Multivariate Time-series ForecastingPeMS08
MSE0.136
71
Showing 10 of 11 rows

Other info

Follow for update