Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

About

Over the past years, foundation models have caused a paradigm shift in machine learning due to their unprecedented capabilities for zero-shot and few-shot generalization. However, despite the success of foundation models in modalities such as natural language processing and computer vision, the development of foundation models for time series forecasting has lagged behind. We present Lag-Llama, a general-purpose foundation model for univariate probabilistic time series forecasting based on a decoder-only transformer architecture that uses lags as covariates. Lag-Llama is pretrained on a large corpus of diverse time series data from several domains, and demonstrates strong zero-shot generalization capabilities compared to a wide range of forecasting models on downstream datasets across domains. Moreover, when fine-tuned on relatively small fractions of such previously unseen datasets, Lag-Llama achieves state-of-the-art performance, outperforming prior deep learning approaches, emerging as the best general-purpose model on average. Lag-Llama serves as a strong contender to the current state-of-art in time series forecasting and paves the way for future advancements in foundation models tailored to time series data.

Kashif Rasul, Arjun Ashok, Andrew Robert Williams, Hena Ghonia, Rishika Bhagwatkar, Arian Khorasani, Mohammad Javad Darvishi Bayazi, George Adamopoulos, Roland Riachi, Nadhir Hassen, Marin Bilo\v{s}, Sahil Garg, Anderson Schneider, Nicolas Chapados, Alexandre Drouin, Valentina Zantedeschi, Yuriy Nevmyvaka, Irina Rish• 2023

Related benchmarks

TaskDatasetResultRank
Time Series ForecastingETT1
RMSE0.61
62
Time Series ForecastingNY-B
RMSE2.9
36
Time Series ForecastingNasdaq
RMSE0.24
36
Time Series ForecastingETT2
RMSE0.57
36
Time Series ForecastingNY-T
RMSE12.84
36
Time Series ForecastingPEM-B
RMSE3.9
36
Time Series ForecastingFlu-US
RMSE1.46
36
Time Series ForecastingFlu-Japan
RMSE1.42e+3
36
Anomaly DetectionTSB-AD U
VUS-PR27
34
Time Series Anomaly DetectionIOPS
VUS PR22
21
Showing 10 of 16 rows

Other info

Follow for update