Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ms-Mamba: Multi-scale Mamba for Time-Series Forecasting

About

The problem of Time-series Forecasting is generally addressed by recurrent, Transformer-based and the recently proposed Mamba-based architectures. However, existing architectures generally process their input at a single temporal scale, which may be sub-optimal for many tasks where information changes over multiple time scales. In this paper, we introduce a novel architecture called Multi-scale Mamba (ms-Mamba) to address this gap. ms-Mamba incorporates multiple temporal scales by using multiple Mamba blocks with different sampling rates ($\Delta$s). Our experiments on many benchmarks demonstrate that ms-Mamba outperforms state-of-the-art approaches, including the recently proposed Transformer-based and Mamba-based models. For example, on the Solar-Energy dataset, ms-Mamba outperforms its closest competitor S-Mamba (0.229 vs. 0.240 in terms of mean-squared error) while using fewer parameters (3.53M vs. 4.77M), less memory (13.46MB vs. 18.18MB), and less operations (14.93G vs. 20.53G MACs), averaged across four forecast lengths. Codes and models will be made available.

Yusuf Meric Karadag, Ismail Talaz, Ipek Gursel Dino, Sinan Kalkan• 2025

Related benchmarks

TaskDatasetResultRank
Time Series ForecastingETTh1
MSE0.384
836
Time Series ForecastingETTh2
MSE0.291
796
Long-term time-series forecastingWeather
MSE0.163
525
Time Series ForecastingPeMS08
MSE0.073
229
Time Series ForecastingPeMS03
MSE0.066
176
Time Series ForecastingPeMS04
MSE0.072
169
Time Series ForecastingPeMS07
MSE0.06
168
Long-term time-series forecastingExchange
MSE0.086
140
Long-term time-series forecastingSolar Energy
MSE0.195
126
Long-term time-series forecastingElectricity
MSE0.138
75
Showing 10 of 12 rows

Other info

Follow for update