Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fast Training of Mixture-of-Experts for Time Series Forecasting via Expert Loss Integration

About

We propose a novel adaptive Mixture-of-Experts (MoE) framework for time series forecasting that enhances expert specialization by incorporating expert-specific loss information directly into the training process. Notably, the overall objective comprises the base forecasting loss and expert-specific losses, allowing expert-level prediction errors to jointly shape training alongside the global forecasting loss. This framework is further combined with a partial online learning strategy, enabling incremental updates of both the gating mechanism and expert parameters. This approach significantly reduces computational cost by eliminating the need for repeated full model retraining. By integrating expert-level loss awareness with efficient online optimization, the proposed method achieves improved learning efficiency while maintaining strong predictive performance. Empirical results across economic, tourism, and energy datasets with varying frequencies demonstrate that the proposed approach generally outperforms both statistical methods and state-of-the-art neural network models, such as Transformers and WaveNet, in forecasting accuracy and computational efficiency. Furthermore, ablation studies confirm the effectiveness of the expert-specific loss integration strategy, highlighting its contribution to enhancing predictive performance.

Btissame El Mahtout, Florian Ziel• 2026

Related benchmarks

TaskDatasetResultRank
Time Series Forecastingdominick
MASE0.001
30
Time Series ForecastingElectricity Weekly
Median MASE0.651
13
Time Series ForecastingSaugeen River Flow
Median MASE1.415
13
Time Series ForecastingElectricity Weekly
Median RMSE6.93e+3
13
Time Series ForecastingM4 Hourly
MASE1.34
13
Time Series ForecastingSaugeen River Flow
MASE1.41
13
Time Series ForecastingSaugeen River Flow
Mean MAE21.33
13
Time Series ForecastingTourism Monthly
Mean MAE1.81e+3
13
Time Series ForecastingTourism Monthly
Mean RMSE2.27e+3
13
Time Series ForecastingElectricity Weekly
Median MAE5.80e+3
13
Showing 10 of 34 rows

Other info

Follow for update