Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Zero-Shot Class Unlearning in CLIP with Synthetic Samples

About

Machine unlearning is a crucial area of research. It is driven by the need to remove sensitive information from models to safeguard individuals' right to be forgotten under rigorous regulations such as GDPR. In this work, we focus on unlearning within CLIP, a dual vision-language encoder model trained on a massive dataset of image-text pairs using contrastive loss. To achieve forgetting we expand the application of Lipschitz regularization to the multimodal context of CLIP. Specifically, we ensure the smoothing of both visual and textual embeddings associated with the class intended to be forgotten relative to the perturbation introduced to the samples from that class. Additionally, importantly, we remove the necessity for real forgetting data by generating synthetic samples through gradient ascent maximizing the target class. Our forgetting procedure is iterative, where we track accuracy on a synthetic forget set and stop when accuracy falls below a chosen threshold. We employ a selective layers update strategy based on their average absolute gradient value to mitigate over-forgetting. We validate our approach on several standard datasets and provide thorough ablation analysis and comparisons with previous work.

A. Kravets, V. Namboodiri• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationObjectNet
Accuracy27.66
251
Image ClassificationDomainNet
Accuracy (ClipArt)90.5
238
Image ClassificationFood
Accuracy79.11
152
Image ClassificationSTL
Top-1 Acc96.42
89
Image ClassificationOxford Flowers (test)--
85
Continual UnlearningImageNet-1K
Retention Score52.77
60
Single-class UnlearningCIFAR-10
Retain Accuracy75.67
42
Machine UnlearningImageNet
Utility Preservation53.85
33
Machine UnlearningTiny-ImageNet--
28
Image ClassificationTarget Classes Forget Set (test)
BF97.1
24
Showing 10 of 38 rows

Other info

Follow for update