Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mitigating Privacy Risk via Forget Set-Free Unlearning

About

Training machine learning models requires the storage of large datasets, which often contain sensitive or private data. Storing data is associated with a number of potential risks which increase over time, such as database breaches and malicious adversaries. Machine unlearning is the study of methods to efficiently remove the influence of training data subsets from previously-trained models. Existing unlearning methods typically require direct access to the "forget set" -- the data to be forgotten-and organisations must retain this data for unlearning rather than deleting it immediately upon request, increasing risks associated with the forget set. We introduce partially-blind unlearning -- utilizing auxiliary information to unlearn without explicit access to the forget set. We also propose a practical framework Reload, a partially-blind method based on gradient optimization and structured weight sparsification to operationalize partially-blind unlearning. We show that Reload efficiently unlearns, approximating models retrained from scratch, and outperforms several forget set-dependent approaches. On language models, Reload unlearns entities using <0.025% of the retain set and <7% of model weights in <8 minutes on Llama2-7B. In the corrective case, Reload achieves unlearning even when only 10% of corrupted data is identified.

Aviraj Newatia, Michael Cooper, Viet Nguyen, Rahul G. Krishnan• 2026

Related benchmarks

TaskDatasetResultRank
Class UnlearningCIFAR-10
Retain Accuracy-0.2
60
Machine UnlearningTOFU (5%)
Forget Quality0.5453
59
Machine UnlearningCIFAR-100 In Class Random Forgetting
RA (Utility Retention)99.56
40
Machine UnlearningTOFU (10%)
Forget Quality (FQ)0.7
37
PoisoningCIFAR100
Poisoning Cost0.24
36
PoisoningCIFAR10
Attack Cost0.29
36
Machine UnlearningTOFU (1%)
Forget Quality (FQ)0.4046
36
Machine UnlearningTOFU Forget10 Phi-1.5B model
Forget Quality (FQ)0.468
24
Machine UnlearningCIFAR-10 30% random data forgetting--
24
Poisoning UnlearningCIFAR-10
Accuracy Retention-7.55
21
Showing 10 of 30 rows

Other info

Follow for update