Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks

About

In this work, we present DEcoupLEd Distillation To Erase (DELETE), a general and strong unlearning method for any class-centric tasks. To derive this, we first propose a theoretical framework to analyze the general form of unlearning loss and decompose it into forgetting and retention terms. Through the theoretical framework, we point out that a class of previous methods could be mainly formulated as a loss that implicitly optimizes the forgetting term while lacking supervision for the retention term, disturbing the distribution of pre-trained model and struggling to adequately preserve knowledge of the remaining classes. To address it, we refine the retention term using "dark knowledge" and propose a mask distillation unlearning method. By applying a mask to separate forgetting logits from retention logits, our approach optimizes both the forgetting and refined retention components simultaneously, retaining knowledge of the remaining classes while ensuring thorough forgetting of the target class. Without access to the remaining data or intervention (i.e., used in some works), we achieve state-of-the-art performance across various benchmarks. What's more, DELETE is a general solution that can be applied to various downstream tasks, including face recognition, backdoor defense, and semantic segmentation with great performance.

Yu Zhou, Dian Zheng, Qijie Mo, Renjie Lu, Kun-Yu Lin, Wei-Shi Zheng• 2025

Related benchmarks

Task	Dataset	Result
Semantic segmentation	Pascal VOC (test)	--	268
Machine Unlearning	CIFAR-10	Accf0.00e+0	45
Machine Unlearning	Tiny-ImageNet (train)	Forgetting Accuracy (Train)10.6	43
Single-class Unlearning	CIFAR-10	Retain Accuracy100	42
Machine Unlearning	Tiny-ImageNet Swin-T (test)	Residual Accuracy73.87	28
Machine Unlearning	Tiny ImageNet (test)	Residual Accuracy81.07	23
Machine Unlearning	CIFAR-100 (train)	Accuracy ($D_f$)0.00e+0	19
Machine Unlearning	Tiny-ImageNet 200 classes (train test)	Acctr (Residual)99.13	13
Machine Unlearning	ToxiGen (train)	Accuracy ($D_f$)42.86	13
Machine Unlearning	ToxiGen (test)	Accuracy ($D_f$)39.53	13

Showing 10 of 36 rows

Other info

Follow for update

@wizwand_team Discord