ms-Mamba: Multi-scale Mamba for Time-Series Forecasting
About
The problem of Time-series Forecasting is generally addressed by recurrent, Transformer-based and the recently proposed Mamba-based architectures. However, existing architectures generally process their input at a single temporal scale, which may be sub-optimal for many tasks where information changes over multiple time scales. In this paper, we introduce a novel architecture called Multi-scale Mamba (ms-Mamba) to address this gap. ms-Mamba incorporates multiple temporal scales by using multiple Mamba blocks with different sampling rates ($\Delta$s). Our experiments on many benchmarks demonstrate that ms-Mamba outperforms state-of-the-art approaches, including the recently proposed Transformer-based and Mamba-based models. For example, on the Solar-Energy dataset, ms-Mamba outperforms its closest competitor S-Mamba (0.229 vs. 0.240 in terms of mean-squared error) while using fewer parameters (3.53M vs. 4.77M), less memory (13.46MB vs. 18.18MB), and less operations (14.93G vs. 20.53G MACs), averaged across four forecast lengths. Codes and models will be made available.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | ETTh1 | MSE0.384 | 729 | |
| Time Series Forecasting | ETTh2 | MSE0.291 | 561 | |
| Long-term time-series forecasting | Weather | MSE0.163 | 448 | |
| Time Series Forecasting | PeMS08 | MSE0.073 | 212 | |
| Time Series Forecasting | PeMS03 | MSE0.066 | 176 | |
| Time Series Forecasting | PeMS07 | MSE0.06 | 168 | |
| Time Series Forecasting | PeMS04 | MSE0.072 | 157 | |
| Long-term time-series forecasting | Solar Energy | MSE0.195 | 107 | |
| Long-term time-series forecasting | Exchange | MSE0.086 | 79 | |
| Time Series Forecasting | Traffic | MSE0.375 | 75 |