Unlocking the Power of LSTM for Long Term Time Series Forecasting

About

Traditional recurrent neural network architectures, such as long short-term memory neural networks (LSTM), have historically held a prominent role in time series forecasting (TSF) tasks. While the recently introduced sLSTM for Natural Language Processing (NLP) introduces exponential gating and memory mixing that are beneficial for long term sequential learning, its potential short memory issue is a barrier to applying sLSTM directly in TSF. To address this, we propose a simple yet efficient algorithm named P-sLSTM, which is built upon sLSTM by incorporating patching and channel independence. These modifications substantially enhance sLSTM's performance in TSF, achieving state-of-the-art results. Furthermore, we provide theoretical justifications for our design, and conduct extensive comparative and analytical experiments to fully validate the efficiency and superior performance of our model.

Yaxuan Kong, Zepu Wang, Yuqi Nie, Tian Zhou, Stefan Zohren, Yuxuan Liang, Peng Sun, Qingsong Wen• 2024

Related benchmarks

Task	Dataset	Result
Time Series Forecasting	ETTh2	MSE0.349	796
Long-term forecasting	ETTh1	MSE0.438	409
Time Series Forecasting	ETTm2	MSE0.269	300
Time Series Forecasting	ECL	MSE0.171	294
Forecasting	Traffic	MSE0.417	71
Forecasting	Weather	MAE0.256	41
Time Series Forecasting	ETTm1	MSE0.374	29
Forecasting	solar	MAE0.261	28
GPP prediction	FLUXNET	RMSE1.94	19
CH4 prediction	FLUXNET CH4	RMSE62.84	14

Showing 10 of 25 rows

Other info

Follow for update

@wizwand_team Discord