Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Computationally Budgeted Continual Learning: What Does Matter?

About

Continual Learning (CL) aims to sequentially train models on streams of incoming data that vary in distribution by preserving previous knowledge while adapting to new data. Current CL literature focuses on restricted access to previously seen data, while imposing no constraints on the computational budget for training. This is unreasonable for applications in-the-wild, where systems are primarily constrained by computational and time budgets, not storage. We revisit this problem with a large-scale benchmark and analyze the performance of traditional CL approaches in a compute-constrained setting, where effective memory samples used in training can be implicitly restricted as a consequence of limited computation. We conduct experiments evaluating various CL sampling strategies, distillation losses, and partial fine-tuning on two large-scale datasets, namely ImageNet2K and Continual Google Landmarks V2 in data incremental, class incremental, and time incremental settings. Through extensive experiments amounting to a total of over 1500 GPU-hours, we find that, under compute-constrained setting, traditional CL approaches, with no exception, fail to outperform a simple minimal baseline that samples uniformly from memory. Our conclusions are consistent in a different number of stream time steps, e.g., 20 to 200, and under several computational budgets. This suggests that most existing CL methods are particularly too computationally expensive for realistic budgeted deployment. Code for this project is available at: https://github.com/drimpossible/BudgetCL.

Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet Dokania, Philip H.S. Torr, Ser-Nam Lim, Bernard Ghanem, Adel Bibi• 2023

Related benchmarks

TaskDatasetResultRank
Time Series ForecastingETTm2--
382
Short-term forecastingM4 Quarterly
MASE8.083
141
Short-term forecastingM4 Monthly
MASE7.735
125
Time Series ForecastingETTh2
MASE1.386
66
Time Series ForecastingM4 Daily
MASE7.249
31
Time Series ForecastingGIFT-Eval bizitobs-application-60
MASE1.173
27
Univariate Time Series ForecastingUs_births
MASE0.979
19
Time Series ForecastingCloudD2
MASE1.32
14
Time Series ForecastingCloudD1
MASE1.463
14
Time Series ForecastingBizITObs-L2C
MASE3.124
14
Showing 10 of 10 rows

Other info

Follow for update