Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting

About

Time series forecasting (TSF) is critical in domains like energy, finance, healthcare, and logistics, requiring models that generalize across diverse datasets. Large pre-trained models such as Chronos and Time-MoE show strong zero-shot (ZS) performance but suffer from high computational costs. In this work, we introduce Super-Linear, a lightweight and scalable mixture-of-experts (MoE) model for general forecasting. It replaces deep architectures with simple frequency-specialized linear experts, trained on resampled data across multiple frequency regimes. A lightweight spectral gating mechanism dynamically selects relevant experts, enabling efficient, accurate forecasting. Despite its simplicity, Super-Linear demonstrates strong performance across benchmarks, while substantially improving efficiency, robustness to sampling rates, and interpretability. The implementation of Super-Linear is available at: \href{https://github.com/azencot-group/SuperLinear}{https://github.com/azencot-group/SuperLinear}.

Liran Nochumsohn, Raz Marshanski, Hedi Zisling, Omri Azencot• 2025

Related benchmarks

Task	Dataset	Result
Time Series Forecasting	ETTh1	MSE0.369	836
Time Series Forecasting	ETTh2	--	796
Time Series Forecasting	ETTm2	MSE0.179	536
Long-term time-series forecasting	ETTh1 (test)	MSE0.364	410
Time Series Forecasting	ETTh1 (test)	MSE0.364	398
Time Series Forecasting	ETTh2 (test)	MSE0.346	250
Time Series Forecasting	Electricity	MSE0.141	237
Long-term time-series forecasting	Weather (test)	MSE0.146	223
Long-term time-series forecasting	ETTh2 (test)	MSE0.272	216
Time Series Forecasting	Traffic	MSE0.414	211

Showing 10 of 28 rows

Other info

Follow for update

@wizwand_team Discord