MSTN: A Lightweight and Fast Model for General TimeSeries Analysis
About
Real-world time series often exhibit strong non-stationarity, complex nonlinear dynamics, and behavior expressed across multiple temporal scales, from rapid local fluctuations to slow-evolving long-range trends. However, many contemporary architectures impose rigid, fixed-scale structural priors-such as patch-based tokenization, predefined receptive fields, or frozen backbone encoders-which can over-regularize temporal dynamics and limit adaptability to abrupt high-magnitude events. To handle this, we introduce the Multi-scale Temporal Network (MSTN), a hybrid neural architecture grounded in an Early Temporal Aggregation principle. MSTN integrates three complementary components: (i) a multi-scale convolutional encoder that captures fine-grained local structure; (ii) a sequence modeling module that learns long-range dependencies through either recurrent or attention-based mechanisms; and (iii) a self-gated fusion stage incorporating squeeze-excitation and a single dense layer to dynamically reweight and fuse multi-scale representations. This design enables MSTN to flexibly model temporal patterns spanning milliseconds to extended horizons, while avoiding the computational burden typically associated with long-context models. Across extensive benchmarks covering imputation, long-term forecasting, classification, and cross-dataset generalization, MSTN achieves state-of-the-art performance, establishing new best results on 21 of 27 datasets, while remaining lightweight (~0.40M params for MSTN-BiLSTM and ~1.06M for MSTN-Transformer) and suitable for low-latency inference (<1 sec, often in milliseconds), resource-constrained deployment.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Imputation | ETTh1 | MSE0.002 | 162 | |
| Time Series Imputation | ETTm1 | MSE0.002 | 159 | |
| Time Series Imputation | Weather | MAE0.014 | 155 | |
| Time Series Imputation | ETTm2 | MSE0.003 | 125 | |
| Time Series Imputation | ETTh2 | MSE0.003 | 108 | |
| Human Activity Recognition | UCI-HAR | Accuracy96.84 | 86 | |
| Short-term forecasting | PeMS03 | MAE8.21 | 65 | |
| Short-term forecasting | PeMS07 | MAE7.67 | 62 | |
| Time Series Imputation | ECL | MSE0.027 | 57 | |
| Physical Activity Recognition | PAMAP2 | Acc99.89 | 55 |