A Simple State Space Model Excels at Multivariate Time Series Classification
About
Structured state space models (SSMs) have recently emerged as a promising foundation for sequence modeling, with Mamba-based architectures demonstrating strong performance through input-dependent state transitions, albeit at considerable complexity. However, their application to time-series classification (TSC) has been largely limited to Mamba-style architectures, leaving the broader SSM design space underexplored. We present the first systematic study spanning diagonal SSMs (S4D) and input-dependent SSMs (Mamba family) on large-scale TSC benchmarks, asking whether such complexity is necessary for top performance. Our results reveal a surprising finding: S4D consistently outperforms Mamba-based variants in both accuracy and efficiency, challenging the assumption that increased complexity translates to meaningful gains in TSC. Building on this, we introduce MS4, lightweight modifications to S4D via a linear input projection and channel-mixing mechanism, and MS4N, a normalized variant that stabilizes state dynamics with negligible overhead. Evaluated on 59 datasets across MONSTER (up to 60 million samples, 50K timesteps, 82 classes) and the UEA benchmark, against 15 baselines, MS4 and MS4N consistently outperform Mamba-based models while remaining more efficient, and MS4N matches or surpasses competing deep learning models that are roughly 2x and 10x larger in parameters. These results position lightweight structured SSMs as a compelling alternative to scaling complexity for TSC.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multivariate Time Series Classification | UEA multivariate time-series archive (test) | Ethanol Concentration Score70.9 | 47 | |
| Time-series classification | PAMAP2 | -- | 40 | |
| Time-series classification | AudioMNIST-DS | Error Rate5 | 14 | |
| Time-series classification | DREAMERA | Error Rate0.476 | 14 | |
| Time-series classification | WhaleSounds | Error Rate22.8 | 14 | |
| Time-series classification | AudioMnist | Error Rate1.6 | 14 | |
| Time-series classification | CrowdSourced | Error Rate31.4 | 14 | |
| Time-series classification | LakeIce | Error Rate0.3 | 14 | |
| Time-series classification | LenDB | Error Rate5.4 | 14 | |
| Time-series classification | WISDM | Error Rate0.132 | 14 |