Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Three-Stage Learning Unlocks Strong Performance in Simple Models for Long-Term Time Series Forecasting

About

Recent studies on long-term time series forecasting have shown that simple linear models and MLP-based predictors can achieve strong performance without increasingly complex architectures. However, many competitive baselines still rely on structural priors such as frequency-domain modeling, explicit decomposition, multi-scale mixing, or sophisticated cross-variable interaction modules, while paying less attention to how simple temporal mappings should be trained and organized. In this paper, we propose STAIR, short for Stagewise Temporal Adaptation via Individualization and Residual Learning, a training paradigm for long-term time series forecasting that aims to unlock the capacity of simple temporal mapping models without introducing complex architectural modules. STAIR decomposes forecasting ability into three progressive stages: it first learns common temporal dynamics across variables through a shared temporal mapping, then adapts the shared model to each variable via channel-wise fine-tuning to capture variable-specific patterns, and finally complements the backbone with cross-variable information through residual learning. We further introduce Shared-to-Individual Fine-tuning and alpha-RevIN to mitigate the limitations of strict channel independence and the overly strong normalization prior induced by standard RevIN. This design gradually increases modeling flexibility while keeping the core temporal predictor as a shallow MLP in the main experiments, with linear variants analyzed separately. Experiments on nine long-term forecasting benchmarks show that STAIR matches or outperforms recent strong baselines while preserving a simple temporal backbone, providing a concise and effective modeling perspective for long-term time series forecasting.

Zhenan Yu, Guangxin Jiang, Jin Yang• 2026

Related benchmarks

TaskDatasetResultRank
Long-term time-series forecastingETTh1
MAE0.433
575
Long-term time-series forecastingWeather
MSE0.246
525
Long-term time-series forecastingETTh2
MSE0.372
461
Long-term time-series forecastingETTm1
MSE0.376
461
Long-term time-series forecastingETTm2
MSE0.271
455
Long-term time-series forecastingTraffic
MSE0.466
427
Time Series ForecastingExchange
MSE0.08
227
Long-term time-series forecastingExchange
MSE0.315
140
Long-term time-series forecastingElectricity
MSE0.174
114
Time Series Forecastingsolar
MSE0.198
106
Showing 10 of 16 rows

Other info

Follow for update