TimeMAE: Self-Supervised Representations of Time Series with Decoupled Masked Autoencoders
About
Learning transferable representations from unlabeled time series is crucial for improving performance in data-scarce classification. Existing self-supervised methods often operate at the point level and rely on unidirectional encoding, leading to low semantic density and a mismatch between pre-training and downstream optimization. In this paper, we propose TimeMAE, a self-supervised framework that reformulates masked modeling for time series via semantic unit elevation and decoupled representation learning. Instead of modeling individual time steps, TimeMAE segments time series into non-overlapping sub-series to form semantically enriched units, enabling more informative masked reconstruction while reducing computational cost. To address the representation discrepancy introduced by masking, we design a decoupled masked autoencoder that separately encodes visible and masked regions, avoiding artificial masked tokens in the main encoder. To guide pre-training, we introduce two complementary objectives: masked codeword classification, which discretizes sub-series semantics via a learned tokenizer and masked representation regression, which aligns continuous representations through a momentum-updated target encoder. Extensive experiments on five datasets demonstrate that TimeMAE outperforms competitive baselines, particularly in label-scarce scenarios and transfer learning scenarios.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multivariate Forecasting | ETTh1 | MSE0.395 | 686 | |
| Time Series Forecasting | ETTh2 | MSE0.455 | 561 | |
| Time Series Forecasting | Weather | MSE0.295 | 295 | |
| Time Series Forecasting | Exchange | MSE0.468 | 199 | |
| Time Series Forecasting | Electricity | MSE0.225 | 114 | |
| Time Series Forecasting | ETTm2 | MSE0.33 | 53 | |
| Classification | Synthetic Dynamical Systems Lorenz, Thomas, and Hindmarsh-Rose (test) | Accuracy96.29 | 40 | |
| Classification | ECG | Accuracy25.46 | 30 | |
| Multivariate time series prediction | Electricity | MSE0.133 | 24 | |
| Classification | PPG (1% labels) | Accuracy44.4 | 19 |