Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Federated Unlearning via Class-Discriminative Pruning

About

We explore the problem of selectively forgetting categories from trained CNN classification models in the federated learning (FL). Given that the data used for training cannot be accessed globally in FL, our insights probe deep into the internal influence of each channel. Through the visualization of feature maps activated by different channels, we observe that different channels have a varying contribution to different categories in image classification. Inspired by this, we propose a method for scrubbing the model clean of information about particular categories. The method does not require retraining from scratch, nor global access to the data used for training. Instead, we introduce the concept of Term Frequency Inverse Document Frequency (TF-IDF) to quantize the class discrimination of channels. Channels with high TF-IDF scores have more discrimination on the target categories and thus need to be pruned to unlearn. The channel pruning is followed by a fine-tuning process to recover the performance of the pruned model. Evaluated on CIFAR10 dataset, our method accelerates the speed of unlearning by 8.9x for the ResNet model, and 7.9x for the VGG model under no degradation in accuracy, compared to retraining from scratch. For CIFAR100 dataset, the speedups are 9.9x and 8.4x, respectively. We envision this work as a complementary block for FL towards compliance with legal and ethical criteria.

Junxiao Wang, Song Guo, Xin Xie, Heng Qi• 2021

Related benchmarks

TaskDatasetResultRank
Backdoor Feature UnlearningFMNIST Du (unlearn)
Accuracy72.19
6
Backdoor Feature UnlearningMNIST Du split (unlearn)
Accuracy69.37
6
Backdoor Feature UnlearningCIFAR-10 Du (unlearn)
Accuracy62.75
6
Backdoor Feature UnlearningCIFAR-100 Du (unlearn)
Accuracy57.29
6
Backdoor Feature UnlearningImageNet Du (unlearn)
Accuracy62.39
6
Sensitive Feature UnlearningDiabetes
ASR81.91
6
Backdoor Feature UnlearningCIFAR-20 Du (unlearn)
Accuracy50.17
6
Sensitive Feature UnlearningCelebA
Attack Success Rate75.36
6
Sensitive Feature UnlearningAdult
ASR72.83
6
Sensitive Feature UnlearningIMDB
ASR (%)83.67
6
Showing 10 of 32 rows

Other info

Follow for update