Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bayesian continual learning and forgetting in neural networks

About

Biological synapses effortlessly balance memory retention and flexibility, yet artificial neural networks still struggle with the extremes of catastrophic forgetting and catastrophic remembering. Here, we introduce Metaplasticity from Synaptic Uncertainty (MESU), a Bayesian framework that updates network parameters according their uncertainty. This approach allows a principled combination of learning and forgetting that ensures that critical knowledge is preserved while unused or outdated information is gradually released. Unlike standard Bayesian approaches -- which risk becoming overly constrained, and popular continual-learning methods that rely on explicit task boundaries, MESU seamlessly adapts to streaming data. It further provides reliable epistemic uncertainty estimates, allowing out-of-distribution detection, the only computational cost being to sample the weights multiple times to provide proper output statistics. Experiments on image-classification benchmarks demonstrate that MESU mitigates catastrophic forgetting, while maintaining plasticity for new tasks. When training 200 sequential permuted MNIST tasks, MESU outperforms established continual learning techniques in terms of accuracy, capability to learn additional tasks, and out-of-distribution data detection. Additionally, due to its non-reliance on task boundaries, MESU outperforms conventional learning techniques on the incremental training of CIFAR-100 tasks consistently in a wide range of scenarios. Our results unify ideas from metaplasticity, Bayesian inference, and Hessian-based regularization, offering a biologically-inspired pathway to robust, perpetual learning.

Djohan Bonnet, Kellian Cottart, Tifenn Hirtzlin, Tarcisius Januel, Thomas Dalgaty, Elisa Vianello, Damien Querlioz• 2025

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionOpenLORIS-Object (held-out toy class)
Aleatoric AUC1
24
Lifelong Object RecognitionOpenLORIS-Object (12-task stream)
Mean Accuracy87.84
24
Online Continual LearningPermuted MNIST 1000-tasks (last 5 tasks)
Mean Accuracy (5 Tasks)92.99
16
Out-of-Distribution DetectionMNIST vs Fashion-MNIST 1000-tasks Permuted
OOD Detection AUC95
15
Image ClassificationPermuted-MNIST Single-task
Accuracy (1 Task)96.1
8
Showing 5 of 5 rows

Other info

Follow for update