MSTN: A Lightweight and Fast Model for General TimeSeries Analysis
About
Real-world time series often exhibit strong non-stationarity, complex nonlinear dynamics, and behavior expressed across multiple temporal scales, from rapid local fluctuations to slow-evolving long-range trends. However, many contemporary architectures impose rigid, fixed-scale structural priors -- such as patch-based tokenization, predefined receptive fields, or frozen backbone encoders -- which can over-regularize temporal dynamics and limit adaptability to abrupt high-magnitude events. To handle this, we introduce the Multi-scale Temporal Network (MSTN), a hybrid neural architecture grounded in an Early Temporal Aggregation principle. MSTN integrates three complementary components: (i) a multi-scale convolutional encoder that captures fine-grained local structure; (ii) a sequence modeling module that learns long-range dependencies through either recurrent or attention-based mechanisms; and (iii) a self-gated fusion stage incorporating squeeze-excitation and a single dense layer to dynamically reweight and fuse multi-scale representations. This design enables MSTN to flexibly model temporal patterns spanning milliseconds to extended horizons, while avoiding the computational burden typically associated with long-context models. Across extensive benchmarks covering imputation, long term forecasting, short term forecasting, classification, and cross-dataset generalization, MSTN achieves state-of-the-art performance, establishing new best results on 33 of 40 datasets, while remaining lightweight ($\sim$278,520 params for MSTN-BiLSTM and $\sim$950,776 $\approx$ 1M for MSTN-Transformer) and suitable for low-latency inference ($<$1 sec, often in milliseconds), resource-constrained deployment.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Imputation | ETTm1 | MSE0.002 | 151 | |
| Time Series Imputation | ETTh1 | MSE0.002 | 149 | |
| Time Series Imputation | Weather | MAE0.014 | 143 | |
| Time Series Imputation | ETTm2 | MSE0.003 | 117 | |
| Time Series Imputation | ETTh2 | MSE0.003 | 100 | |
| Physical Activity Recognition | PAMAP2 | Acc99.89 | 55 | |
| Short-term forecasting | PeMS03 | MAE8.21 | 54 | |
| Short-term forecasting | PeMS07 | MAE7.67 | 51 | |
| Time Series Imputation | ECL | MSE0.027 | 45 | |
| Long-term forecasting | PeMS03 | MSE0.01 | 41 |