Ister: Linear Transformer for Efficient Multivariate Time Series Forecasting

About

Transformer-based models have achieved remarkable success in multivariate time series forecasting (MTSF) by capturing long-range dependencies. However, their widespread adoption is hindered by the quadratic computational complexity of self-attention, which limits scalability on high-dimensional sequences. To address this challenge, we propose the Inverted Seasonal-Trend Decomposition Transformer (Ister), a novel architecture that enhances both predictive accuracy and computational efficiency. Central to Ister is Dot-attention, a linear-complexity attention mechanism that replaces conventional multi-head self-attention with element-wise dot-product operations to model inter-series dependencies. Furthermore, we introduce an inverted seasonal-trend decomposition strategy that isolates periodic components, enabling the model to focus learning on periodic patterns, thereby improving the performance of channel alignment. Extensive experiments across several real-world benchmarks demonstrate that Ister consistently achieves state-of-the-art performance. Code is available at https://github.com/macovaseas/Ister.

Fanpu Cao, Shu Yang, Zhengjian Chen, Ye Liu, Laizhong Cui• 2024

Related benchmarks

Task	Dataset	Result
Multivariate Forecasting	ETTh1	MSE0.438	909
Multivariate Time-series Forecasting	ETTm1	MSE0.386	742
Multivariate Time-series Forecasting	ETTm2	MSE0.279	593
Multivariate Time-series Forecasting	Weather	MSE0.243	466
Multivariate Time-series Forecasting	Traffic	MSE0.399	323
Multivariate Time-series Forecasting	ETTh2	MSE0.349	219
Multivariate Time-series Forecasting	Electricity	MAE0.26	109
Multivariate Time-series Forecasting	PeMS04	MSE0.106	107
Multivariate Time-series Forecasting	PeMS07	MSE0.092	82
Multivariate Time-series Forecasting	PeMS08	MSE0.136	75

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord