Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SEMPO: Lightweight Foundation Models for Time Series Forecasting

About

The recent boom of large pre-trained models witnesses remarkable success in developing foundation models (FMs) for time series forecasting. Despite impressive performance across diverse downstream forecasting tasks, existing time series FMs possess massive network architectures and require substantial pre-training on large-scale datasets, which significantly hinders their deployment in resource-constrained environments. In response to this growing tension between versatility and affordability, we propose SEMPO, a novel lightweight foundation model that requires pretraining on relatively small-scale data, yet exhibits strong general time series forecasting. Concretely, SEMPO comprises two key modules: 1) energy-aware SpEctral decomposition module, that substantially improves the utilization of pre-training data by modeling not only the high-energy frequency signals but also the low-energy yet informative frequency signals that are ignored in current methods; and 2) Mixture-of-PrOmpts enabled Transformer, that learns heterogeneous temporal patterns through small dataset-specific prompts and adaptively routes time series tokens to prompt-based experts for parameter-efficient model adaptation across different datasets and domains. Equipped with these modules, SEMPO significantly reduces both pre-training data scale and model size, while achieving strong generalization. Extensive experiments on two large-scale benchmarks covering 16 datasets demonstrate the superior performance of SEMPO in both zero-shot and few-shot forecasting scenarios compared with state-of-the-art methods. Code and data are available at https://github.com/mala-lab/SEMPO.

Hui He, Kun Yi, Yuanchi Ma, Qi Zhang, Zhendong Niu, Guansong Pang• 2025

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.41
645
Multivariate Time-series ForecastingETTm1
MSE0.503
433
Multivariate ForecastingETTh2
MSE0.341
341
Multivariate Time-series ForecastingETTm2
MSE0.286
334
Multivariate Time-series ForecastingWeather
MSE0.248
276
Anomaly DetectionSMD--
217
Multivariate Time-series ForecastingTraffic
MSE0.466
200
Multivariate Time-series ForecastingECL
MSE0.196
49
Multivariate long-term forecastingETTm1 T=96 (test)
MSE0.466
39
Time Series Anomaly DetectionSMAP
Affiliation F166.18
29
Showing 10 of 20 rows

Other info

Follow for update