Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

About

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a high prediction capacity of the model, which is the ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown the potential of Transformer to increase the prediction capacity. However, there are several severe issues with Transformer that prevent it from being directly applicable to LSTF, including quadratic time complexity, high memory usage, and inherent limitation of the encoder-decoder architecture. To address these issues, we design an efficient transformer-based model for LSTF, named Informer, with three distinctive characteristics: (i) a $ProbSparse$ self-attention mechanism, which achieves $O(L \log L)$ in time complexity and memory usage, and has comparable performance on sequences' dependency alignment. (ii) the self-attention distilling highlights dominating attention by halving cascading layer input, and efficiently handles extreme long input sequences. (iii) the generative style decoder, while conceptually simple, predicts the long time-series sequences at one forward operation rather than a step-by-step way, which drastically improves the inference speed of long-sequence predictions. Extensive experiments on four large-scale datasets demonstrate that Informer significantly outperforms existing methods and provides a new solution to the LSTF problem.

Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, Wancai Zhang• 2020

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.577
645
Time Series ForecastingETTh1
MSE0.144
601
Time Series ForecastingETTh2
MSE0.253
438
Multivariate Time-series ForecastingETTm1
MSE0.323
433
Time Series ForecastingETTm2
MSE3.658
382
Long-term time-series forecastingETTh1
MAE0.562
351
Long-term time-series forecastingWeather
MSE0.3
348
Multivariate long-term forecastingETTh1
MSE0.865
344
Multivariate ForecastingETTh2
MSE0.72
341
Time Series ForecastingETTm1
MSE0.109
334
Showing 10 of 446 rows
...

Other info

Code

Follow for update