Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting
About
Time series forecasting (TSF) is critical in domains like energy, finance, healthcare, and logistics, requiring models that generalize across diverse datasets. Large pre-trained models such as Chronos and Time-MoE show strong zero-shot (ZS) performance but suffer from high computational costs. In this work, we introduce Super-Linear, a lightweight and scalable mixture-of-experts (MoE) model for general forecasting. It replaces deep architectures with simple frequency-specialized linear experts, trained on resampled data across multiple frequency regimes. A lightweight spectral gating mechanism dynamically selects relevant experts, enabling efficient, accurate forecasting. Despite its simplicity, Super-Linear demonstrates strong performance across benchmarks, while substantially improving efficiency, robustness to sampling rates, and interpretability. The implementation of Super-Linear is available at: \href{https://github.com/azencot-group/SuperLinear}{https://github.com/azencot-group/SuperLinear}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | ETTh1 | -- | 601 | |
| Time Series Forecasting | ETTh2 | -- | 438 | |
| Time Series Forecasting | ETTm2 | -- | 382 | |
| Time Series Forecasting | ETTm1 | MAE0.389 | 66 | |
| Time Series Forecasting | GIFT-Eval (test) | MASE85.7 | 34 | |
| Time Series Forecasting | LTSF TSLib (test) | ETTh1 Error0.415 | 21 | |
| Time Series Forecasting | Electricity | MAE0.267 | 8 | |
| Time Series Forecasting | Weather | MAE0.275 | 8 | |
| Time Series Forecasting | LTSF Average | Avg MAE0.343 | 8 |