Towards Privacy-Guaranteed Label Unlearning in Vertical Federated Learning: Few-Shot Forgetting without Disclosure

About

This paper addresses the critical challenge of unlearning in Vertical Federated Learning (VFL), a setting that has received far less attention than its horizontal counterpart. Specifically, we propose the first method tailored to \textit{label unlearning} in VFL, where labels play a dual role as both essential inputs and sensitive information. To this end, we employ a representation-level manifold mixup mechanism to generate synthetic embeddings for both unlearned and retained samples. This is to provide richer signals for the subsequent gradient-based label forgetting and recovery steps. These augmented embeddings are then subjected to gradient-based label forgetting, effectively removing the associated label information from the model. To recover performance on the retained data, we introduce a recovery-phase optimization step that refines the remaining embeddings. This design achieves effective label unlearning while maintaining computational efficiency. We validate our method through extensive experiments on diverse datasets, including MNIST, CIFAR-10, CIFAR-100, ModelNet, Brain Tumor MRI, COVID-19 Radiography, and Yahoo Answers demonstrate strong efficacy and scalability. Overall, this work establishes a new direction for unlearning in VFL, showing that re-imagining mixup as an efficient mechanism can unlock practical and utility-preserving unlearning. The code is publicly available at https://github.com/bryanhx/Towards-Privacy-Guaranteed-Label-Unlearning-in-Vertical-Federated-Learning

Hanlin Gu, Hong Xi Tae, Lixin Fan, Chee Seng Chan• 2024

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	--	3518
Image Classification	CIFAR-10 (test)	--	3381
Image Classification	Tiny ImageNet (test)	Accuracy57.84	722
Class Unlearning	CIFAR-10 (test)	Test Accuracy67.45	42
Single-class Unlearning	CIFAR-10	Retain Accuracy24.1	42
Single-class Unlearning	MNIST	Accuracy Retention (ACCr)14.9	36
Binary Classification	Income (test)	Test Accuracy79.36	34
Class Unlearning	CIFAR-100 (test)	--	22
Class Unlearning	Tiny ImageNet (test)	--	19
Image Classification	MedMNIST PathMNIST (test)	Accuracy83.85	12

Showing 10 of 35 rows

Other info

Follow for update

@wizwand_team Discord