Timer: Generative Pre-trained Transformers Are Large Time Series Models
About
Deep learning has contributed remarkably to the advancement of time series analysis. Still, deep models can encounter performance bottlenecks in real-world data-scarce scenarios, which can be concealed due to the performance saturation with small models on current benchmarks. Meanwhile, large models have demonstrated great powers in these scenarios through large-scale pre-training. Continuous progress has been achieved with the emergence of large language models, exhibiting unprecedented abilities such as few-shot generalization, scalability, and task generality, which are however absent in small deep models. To change the status quo of training scenario-specific small models from scratch, this paper aims at the early development of large time series models (LTSM). During pre-training, we curate large-scale datasets with up to 1 billion time points, unify heterogeneous time series into single-series sequence (S3) format, and develop the GPT-style architecture toward LTSMs. To meet diverse application needs, we convert forecasting, imputation, and anomaly detection of time series into a unified generative task. The outcome of this study is a Time Series Transformer (Timer), which is generative pre-trained by next token prediction and adapted to various downstream tasks with promising capabilities as an LTSM. Code and datasets are available at: https://github.com/thuml/Large-Time-Series-Model.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | ETTh1 | MSE0.438 | 729 | |
| Multivariate Forecasting | ETTh1 | MSE0.418 | 686 | |
| Time Series Forecasting | ETTh2 | MSE0.366 | 561 | |
| Multivariate Time-series Forecasting | ETTm1 | MSE0.352 | 466 | |
| Multivariate long-term forecasting | ETTh1 | MSE0.379 | 394 | |
| Time Series Forecasting | ETTm2 | MSE0.36 | 382 | |
| Multivariate long-term series forecasting | ETTh2 | MSE0.309 | 367 | |
| Multivariate long-term series forecasting | Weather | MSE0.176 | 359 | |
| Multivariate Forecasting | ETTh2 | MSE0.382 | 350 | |
| Time Series Forecasting | ETTh1 (test) | MSE0.412 | 348 |