Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MANGO: Meta-Adaptive Network Gradient Optimization for Online Continual Learning

About

In Online Continual Learning (OCL), a neural network sequentially learns from a non-stationary data stream in a single-pass with access only to a limited memory replay buffer. This contrasts sharply with off-line continual learning where training is multiple epoch dependent on large datasets. The main challenge faced by OCL is to overcome catastrophic forgetting of past tasks (stability) while learning new ones efficiently (plasticity). Existing methods counter forgetting via replay-based rehearsal, output level distillation, fixed regularization, or meta-learning on the current data. However, these methods have limitations: rehearsal introduces a stored sample bias; distillation operates on output-distributions without modulating parameter updates; fixed-regularization penalizes parameters irrespective of sensitivity; stream-only meta-learning lacks a feedback controlled parameter update. We propose Meta-Adaptive Network Gradient Optimization (MANGO), an OCL framework that balances stability-plasticity via gradient-gating and meta-learned regularization. Gradient-gating scales parameter updates based on sensitivity, preventing destructive updates. Meta-learned regularization adapts stability coefficients, evaluating the effect of parameter update on replay. In MANGO, replay acts as both a training signal and a forgetting evaluator. We evaluated our method on three standard OCL benchmark datasets. MANGO outperforms strong baselines, achieving state-of-the-art results with consistent performance across replay sizes. In domain incremental learning on CLEAR-10 and class incremental learning on CIFAR-100 and Tiny-ImageNet, it achieves highest accuracy among all baselines and achieves positive Backward Transfer, overcoming forgetting on CLEAR-10.

Ankita Awasthi, Marco Apolinario, Kaushik Roy• 2026

Related benchmarks

TaskDatasetResultRank
Task-Incremental LearningCIFAR-100
Accuracy54
41
Class-incremental learningCIFAR100 (test)
BWT (Backward Transfer)-35.7
35
Class-incremental learningCIFAR-100 Split
Average Accuracy20.64
31
Class-incremental learningCIFAR-100 (test)--
30
Class-incremental learningTiny ImageNet (test)--
24
Class-incremental learningTiny ImageNet Split
Accuracy25.15
17
Task-Incremental LearningTiny-ImageNet
Accuracy33.24
17
Domain-incremental learningCLEAR-10
Accuracy67.67
13
Showing 8 of 8 rows

Other info

Follow for update