Targeted Unlearning with Single Layer Unlearning Gradient

About

Machine unlearning methods aim to remove sensitive or unwanted content from trained models, but typically demand extensive model updates at significant computational cost while potentially degrading model performance on both related and unrelated tasks. We propose Single Layer Unlearning Gradient (SLUG) as an efficient method to unlearn targeted information by updating a single critical layer using a one-time gradient computation. SLUG uses layer importance and gradient alignment metrics to identify the optimal layer for targeted information removal while preserving the model utility. We demonstrate the effectiveness of SLUG for CLIP, Stable Diffusion, and vision-language models (VLMs) in removing concrete (e.g., identities and objects) and abstract concepts (e.g., artistic styles). On the UnlearnCanvas benchmark, SLUG achieves comparable unlearning performance to existing methods while requiring significantly less computational resources. Our proposed approach offers a practical solution for targeted unlearning that is computationally efficient and precise. Our code is available at https://github.com/CSIPlab/SLUG.

Zikui Cai, Yaoteng Tan, M. Salman Asif• 2024

Related benchmarks

Task	Dataset	Result
Style Unlearning	UnlearnCanvas	UA0.8629	36
Machine Unlearning	ImageNet	Utility Preservation88	33
Object Unlearning	UnlearnCanvas	Unlearning Accuracy (UA)75.43	31
Machine Unlearning	PACS Sketch (OOD)	Original Forgetting Rate99.87	24
Machine Unlearning	LAION 400M	Forget Accuracy48.46	15
Machine Unlearning	PACS (test)	Original Forgetting99.75	12
Face Identity Unlearning	VggFace2	Original Forgetting80.33	12
object recognition	VggFace2 (test)	Original For.95.67	12
Machine Unlearning	UnlearnCanvas SD-V1.5	Time (s)39	7
Object Forgetting	UnlearnCanvas SD-V3	Unlearning Accuracy (UA)85.44	7

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord