Variational Bayesian Unlearning
About
This paper studies the problem of approximately unlearning a Bayesian model from a small subset of the training data to be erased. We frame this problem as one of minimizing the Kullback-Leibler divergence between the approximate posterior belief of model parameters after directly unlearning from erased data vs. the exact posterior belief from retraining with remaining data. Using the variational inference (VI) framework, we show that it is equivalent to minimizing an evidence upper bound which trades off between fully unlearning from erased data vs. not entirely forgetting the posterior belief given the full data (i.e., including the remaining data); the latter prevents catastrophic unlearning that can render the model useless. In model training with VI, only an approximate (instead of exact) posterior belief given the full data can be obtained, which makes unlearning even more challenging. We propose two novel tricks to tackle this challenge. We empirically demonstrate our unlearning methods on Bayesian models such as sparse Gaussian process and logistic regression using synthetic and real-world datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hierarchical Unlearning | MedForget 1.0 (Forget) | Gen Score57.25 | 72 | |
| Knowledge Retention | MMLU (full) | MMLU Accuracy41.1 | 60 | |
| Machine Unlearning | MNIST | Model Accuracy99.37 | 56 | |
| Classification | MLLMU-Bench Forget Set | Accuracy60 | 51 | |
| Generation | MLLMU-Bench Forget Set | Rouge Score55.7 | 37 | |
| Multimodal Machine Unlearning Evaluation | MLLMU-Bench Forget Set | Classification Accuracy39.69 | 36 | |
| Multimodal Machine Unlearning | Retain Set | Classification Accuracy40 | 35 | |
| Classification | MLLMU-Bench (Retain Set) | Accuracy66.87 | 32 | |
| Cloze | MLLMU-Bench Forget Set | Cloze Accuracy23.91 | 32 | |
| Classification | MLLMU-Bench (test) | Accuracy50.83 | 32 |