Adapting Time Series Foundation Models through Data Mixtures
About
Time series foundation models (TSFMs) have become increasingly popular for zero-shot forecasting. However, for a new time series domain not fully covered by the pretraining set, performance can suffer. Therefore, when a practitioner cares about a new domain and has access to a set of related datasets, the question arises: how best to fine-tune a TSFM to improve zero-shot forecasting? A typical approach to this type of problem is to fine-tune a LoRA module on all datasets or separately on each dataset. Tuning a separate module on each dataset allows for the specialisation of the TSFM to different types of data distribution, by selecting differing combinations of per-dataset modules for different time series contexts. However, we find that, using per-dataset modules might not be optimal, since a time series dataset can contain data from several types of distributions, i.e. sub-domains. This can be due to the distribution shifting or having differing distributions for different dimensions of the time series. Hence, we propose MixFT which re-divides the data using Bayesian mixtures into sets that best represent the sub-domains present in the data, and fine-tunes separately on each of these sets. This re-division of the data ensures that each set is more homogeneous, leading to fine-tuned modules focused on specific sub-domains. Our experiments show that MixFT performs better than per-dataset methods and when fine-tuning a single module on all the data. This suggests that by re-partitioning the data to represent sub-domains we can better specialise TSFMs to improve zero-shot forecasting.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | ETTm2 | -- | 382 | |
| Short-term forecasting | M4 Quarterly | MASE7.998 | 141 | |
| Short-term forecasting | M4 Monthly | MASE7.638 | 125 | |
| Time Series Forecasting | ETTh2 | MASE1.387 | 66 | |
| Time Series Forecasting | M4 Daily | MASE7.238 | 31 | |
| Time Series Forecasting | GIFT-Eval bizitobs-application-60 | MASE0.989 | 27 | |
| Univariate Time Series Forecasting | Us_births | MASE0.942 | 19 | |
| Time Series Forecasting | BizITObs-L2C | MASE3.083 | 14 | |
| Time Series Forecasting | CloudD1 | MASE1.485 | 14 | |
| Time Series Forecasting | CloudD2 | MASE1.343 | 14 |