Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images

About

In this paper, we introduce Selective-distillation for Class and Architecture-agnostic unleaRning (SCAR), a novel approximate unlearning method. SCAR efficiently eliminates specific information while preserving the model's test accuracy without using a retain set, which is a key component in state-of-the-art approximate unlearning algorithms. Our approach utilizes a modified Mahalanobis distance to guide the unlearning of the feature vectors of the instances to be forgotten, aligning them to the nearest wrong class distribution. Moreover, we propose a distillation-trick mechanism that distills the knowledge of the original model into the unlearning model with out-of-distribution images for retaining the original model's test performance without using any retain set. Importantly, we propose a self-forget version of SCAR that unlearns without having access to the forget set. We experimentally verified the effectiveness of our method, on three public datasets, comparing it with state-of-the-art methods. Our method obtains performance higher than methods that operate without the retain set and comparable w.r.t the best methods that rely on the retain set.

Jacopo Bonato, Marco Cotogni, Luigi Sabetta• 2024

Related benchmarks

TaskDatasetResultRank
Class UnlearningCIFAR-10
Retain Accuracy93.57
39
Single-class UnlearningCIFAR-100
ACCr75.42
28
Single-class UnlearningMNIST
Accuracy Retention (ACCr)0.992
28
Class UnlearningCIFAR-10
U-LiRA Accuracy92.38
12
Showing 4 of 4 rows

Other info

Follow for update