Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

One Fits All:Power General Time Series Analysis by Pretrained LM

About

Although we have witnessed great success of pre-trained models in natural language processing (NLP) and computer vision (CV), limited progress has been made for general time series analysis. Unlike NLP and CV where a unified model can be used to perform different tasks, specially designed approach still dominates in each time series analysis task such as classification, anomaly detection, forecasting, and few-shot learning. The main challenge that blocks the development of pre-trained model for time series analysis is the lack of a large amount of data for training. In this work, we address this challenge by leveraging language or CV models, pre-trained from billions of tokens, for time series analysis. Specifically, we refrain from altering the self-attention and feedforward layers of the residual blocks in the pre-trained language or image model. This model, known as the Frozen Pretrained Transformer (FPT), is evaluated through fine-tuning on all major types of tasks involving time series. Our results demonstrate that pre-trained models on natural language or images can lead to a comparable or state-of-the-art performance in all main time series analysis tasks, as illustrated in Figure 1. We also found both theoretically and empirically that the self-attention module behaviors similarly to principle component analysis (PCA), an observation that helps explains how transformer bridges the domain gap and a crucial step towards understanding the universality of a pre-trained transformer.The code is publicly available at https://github.com/DAMO-DI-ML/One_Fits_All.

Tian Zhou, PeiSong Niu, Xue Wang, Liang Sun, Rong Jin• 2023

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.427
645
Time Series ForecastingETTh1
MSE0.376
601
Time Series ForecastingETTh2
MSE0.354
438
Multivariate Time-series ForecastingETTm1
MSE0.352
433
Time Series ForecastingETTm2
MSE0.266
382
Long-term time-series forecastingETTh1
MAE0.426
351
Long-term time-series forecastingWeather
MSE0.228
348
Multivariate long-term forecastingETTh1
MSE0.376
344
Time Series ForecastingETTm1
MSE0.352
334
Multivariate Time-series ForecastingETTm2
MSE0.266
334
Showing 10 of 275 rows
...

Other info

Code

Follow for update