Eternal Sunshine of the Spotless Net: Selective Forgetting in Deep Networks

About

We explore the problem of selectively forgetting a particular subset of the data used for training a deep neural network. While the effects of the data to be forgotten can be hidden from the output of the network, insights may still be gleaned by probing deep into its weights. We propose a method for "scrubbing'" the weights clean of information about a particular set of training data. The method does not require retraining from scratch, nor access to the data originally used for training. Instead, the weights are modified so that any probing function of the weights is indistinguishable from the same function applied to the weights of a network trained without the data to be forgotten. This condition is a generalized and weaker form of Differential Privacy. Exploiting ideas related to the stability of stochastic gradient descent, we introduce an upper-bound on the amount of information remaining in the weights, which can be estimated efficiently even for deep neural networks.

Aditya Golatkar, Alessandro Achille, Stefano Soatto• 2019

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10	Accuracy97.16	507
Semantic segmentation	Pascal VOC (test)	--	268
Image Classification	ObjectNet	Accuracy1.14	251
Image Classification	Food	Accuracy1.38	152
Membership Inference Attack	NYU V2	AUC96.23	90
Image Classification	STL	Top-1 Acc17.65	89
Semantic segmentation	NYU v2 (val)	mIoU75.02	75
Depth Estimation	NYU v2 (val)	--	65
Image Classification	CIFAR-10 (Forget)	Accuracy92.49	63
Class Unlearning	CIFAR-10	Retain Accuracy85.57	60

Showing 10 of 321 rows

...

Other info

Follow for update

@wizwand_team Discord