Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Are Transformers Effective for Time Series Forecasting?

About

Recently, there has been a surge of Transformer-based solutions for the long-term time series forecasting (LTSF) task. Despite the growing performance over the past few years, we question the validity of this line of research in this work. Specifically, Transformers is arguably the most successful solution to extract the semantic correlations among the elements in a long sequence. However, in time series modeling, we are to extract the temporal relations in an ordered set of continuous points. While employing positional encoding and using tokens to embed sub-series in Transformers facilitate preserving some ordering information, the nature of the \emph{permutation-invariant} self-attention mechanism inevitably results in temporal information loss. To validate our claim, we introduce a set of embarrassingly simple one-layer linear models named LTSF-Linear for comparison. Experimental results on nine real-life datasets show that LTSF-Linear surprisingly outperforms existing sophisticated Transformer-based LTSF models in all cases, and often by a large margin. Moreover, we conduct comprehensive empirical studies to explore the impacts of various design elements of LTSF models on their temporal relation extraction capability. We hope this surprising finding opens up new research directions for the LTSF task. We also advocate revisiting the validity of Transformer-based solutions for other time series analysis tasks (e.g., anomaly detection) in the future. Code is available at: \url{https://github.com/cure-lab/LTSF-Linear}.

Ailing Zeng, Muxi Chen, Lei Zhang, Qiang Xu• 2022

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.256
645
Time Series ForecastingETTh1
MSE0.375
601
Time Series ForecastingETTh2
MSE0.276
438
Multivariate Time-series ForecastingETTm1
MSE0.259
433
Time Series ForecastingETTm2
MSE0.162
382
Long-term time-series forecastingETTh1
MAE0.31
351
Long-term time-series forecastingWeather
MSE0.048
348
Multivariate long-term forecastingETTh1
MSE0.375
344
Multivariate ForecastingETTh2
MSE0.183
341
Multivariate Time-series ForecastingETTm2
MSE0.144
334
Showing 10 of 757 rows
...

Other info

Code

Follow for update