Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ms-Mamba: Multi-scale Mamba for Time-Series Forecasting

About

The problem of Time-series Forecasting is generally addressed by recurrent, Transformer-based and the recently proposed Mamba-based architectures. However, existing architectures generally process their input at a single temporal scale, which may be sub-optimal for many tasks where information changes over multiple time scales. In this paper, we introduce a novel architecture called Multi-scale Mamba (ms-Mamba) to address this gap. ms-Mamba incorporates multiple temporal scales by using multiple Mamba blocks with different sampling rates ($\Delta$s). Our experiments on many benchmarks demonstrate that ms-Mamba outperforms state-of-the-art approaches, including the recently proposed Transformer-based and Mamba-based models. For example, on the Solar-Energy dataset, ms-Mamba outperforms its closest competitor S-Mamba (0.229 vs. 0.240 in terms of mean-squared error) while using fewer parameters (3.53M vs. 4.77M), less memory (13.46MB vs. 18.18MB), and less operations (14.93G vs. 20.53G MACs), averaged across four forecast lengths. Codes and models will be made available.

Yusuf Meric Karadag, Ismail Talaz, Ipek Gursel Dino, Sinan Kalkan• 2025

Related benchmarks

TaskDatasetResultRank
Time Series ForecastingETTh1
MSE0.384
729
Time Series ForecastingETTh2
MSE0.291
561
Long-term time-series forecastingWeather
MSE0.163
448
Time Series ForecastingPeMS08
MSE0.073
212
Time Series ForecastingPeMS03
MSE0.066
176
Time Series ForecastingPeMS07
MSE0.06
168
Time Series ForecastingPeMS04
MSE0.072
157
Long-term time-series forecastingSolar Energy
MSE0.195
107
Long-term time-series forecastingExchange
MSE0.086
79
Time Series ForecastingTraffic
MSE0.375
75
Showing 10 of 12 rows

Other info

Follow for update