Introducing Spectral Attention for Long-Range Dependency in Time Series Forecasting
About
Sequence modeling faces challenges in capturing long-range dependencies across diverse tasks. Recent linear and transformer-based forecasters have shown superior performance in time series forecasting. However, they are constrained by their inherent inability to effectively address long-range dependencies in time series data, primarily due to using fixed-size inputs for prediction. Furthermore, they typically sacrifice essential temporal correlation among consecutive training samples by shuffling them into mini-batches. To overcome these limitations, we introduce a fast and effective Spectral Attention mechanism, which preserves temporal correlations among samples and facilitates the handling of long-range information while maintaining the base model structure. Spectral Attention preserves long-period trends through a low-pass filter and facilitates gradient to flow between samples. Spectral Attention can be seamlessly integrated into most sequence models, allowing models with fixed-sized look-back windows to capture long-range dependencies over thousands of steps. Through extensive experiments on 11 real-world time series datasets using 7 recent forecasting models, we consistently demonstrate the efficacy of our Spectral Attention mechanism, achieving state-of-the-art results.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | ETTh1 (test) | MSE0.511 | 262 | |
| Time Series Forecasting | ETTm1 (test) | MSE0.431 | 196 | |
| Time Series Forecasting | Traffic (test) | MSE0.428 | 192 | |
| Time Series Forecasting | ETTh2 (test) | MSE0.304 | 140 | |
| Time Series Forecasting | Weather (test) | MSE0.22 | 110 | |
| Time Series Forecasting | ETTm2 (test) | MSE0.21 | 89 | |
| Time Series Forecasting | ECL (test) | MSE0.176 | 43 | |
| Time Series Forecasting | Illness (ILI) (test) | MSE1.974 | 38 | |
| Time Series Forecasting | PEMS03 (test) | MSE0.198 | 14 | |
| Time Series Forecasting | EnergyData (test) | MSE0.786 | 14 |