MSDformer: Multi-scale Discrete Transformer For Time Series Generation
About
Discrete Token Modeling (DTM), which employs vector quantization techniques, has demonstrated remarkable success in modeling non-natural language modalities, particularly in time series generation. While our prior work SDformer established the first DTM-based framework to achieve state-of-the-art performance in this domain, two critical limitations persist in existing DTM approaches: 1) their inability to capture multi-scale temporal patterns inherent to complex time series data, and 2) the absence of theoretical foundations to guide model optimization. To address these challenges, we proposes a novel multi-scale DTM-based time series generation method, called Multi-Scale Discrete Transformer (MSDformer). MSDformer employs a multi-scale time series tokenizer to learn discrete token representations at multiple scales, which jointly characterize the complex nature of time series data. Subsequently, MSDformer applies a multi-scale autoregressive token modeling technique to capture the multi-scale patterns of time series within the discrete latent space. Theoretically, we validate the effectiveness of the DTM method and the rationality of MSDformer through the rate-distortion theorem. Comprehensive experiments demonstrate that MSDformer significantly outperforms state-of-the-art methods. Both theoretical analysis and experimental results demonstrate that incorporating multi-scale information and modeling multi-scale patterns can substantially enhance the quality of generated time series in DTM-based approaches. Code is available at this repository:https://github.com/kkking-kk/MSDformer.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time-series generation | Energy | Discriminative Score0.01 | 45 | |
| Time-series generation | ETTh | Predictive Score0.118 | 36 | |
| Time-series generation | Stocks | Discriminative Score0.005 | 29 | |
| Time-series generation | Sines | Predictive Score0.093 | 9 | |
| Time-series generation | ETTh | Inference Time0.3 | 3 | |
| Time-series generation | Sines | Inference Time0.95 | 3 | |
| Time-series generation | Stocks | Inference Time0.96 | 3 | |
| Time-series generation | Mujoco | Inference Time0.97 | 3 | |
| Time-series generation | fMRI | Inference Time1.4 | 3 |