Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Eternal Sunshine of the Spotless Net: Selective Forgetting in Deep Networks

About

We explore the problem of selectively forgetting a particular subset of the data used for training a deep neural network. While the effects of the data to be forgotten can be hidden from the output of the network, insights may still be gleaned by probing deep into its weights. We propose a method for "scrubbing'" the weights clean of information about a particular set of training data. The method does not require retraining from scratch, nor access to the data originally used for training. Instead, the weights are modified so that any probing function of the weights is indistinguishable from the same function applied to the weights of a network trained without the data to be forgotten. This condition is a generalized and weaker form of Differential Privacy. Exploiting ideas related to the stability of stochastic gradient descent, we introduce an upper-bound on the amount of information remaining in the weights, which can be estimated efficiently even for deep neural networks.

Aditya Golatkar, Alessandro Achille, Stefano Soatto• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10
Accuracy97.16
507
Semantic segmentationPascal VOC (test)--
236
Image ClassificationCIFAR-10 (Forget)
Accuracy92.49
63
Class UnlearningCIFAR-10
Retain Accuracy85.57
60
Image ClassificationCIFAR-100 standard (Forget)
Accuracy (CIFAR-100 Forget)87.2
54
Image ClassificationTiny-ImageNet standard (Forget)
Accuracy69.2
54
Image ClassificationTiny-ImageNet standard (Retain)
Accuracy65.48
54
Image ClassificationCIFAR-10 standard (Retain)
Accuracy94.14
54
Image ClassificationCIFAR-100 standard (Retain)
Accuracy73.08
54
Machine UnlearningTiny-ImageNet 1 class (forget)
Retain Accuracy65.48
48
Showing 10 of 164 rows
...

Other info

Follow for update