Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Certified Data Removal from Machine Learning Models

About

Good data stewardship requires removal of data at the request of the data's owner. This raises the question if and how a trained machine-learning model, which implicitly stores information about its training data, should be affected by such a removal request. Is it possible to "remove" data from a machine-learning model? We study this problem by defining certified removal: a very strong theoretical guarantee that a model from which data is removed cannot be distinguished from a model that never observed the data to begin with. We develop a certified-removal mechanism for linear classifiers and empirically study learning settings in which this mechanism is practical.

Chuan Guo, Tom Goldstein, Awni Hannun, Laurens van der Maaten• 2019

Related benchmarks

TaskDatasetResultRank
Membership Inference AttackNYU V2
AUC96.51
90
Semantic segmentationNYU v2 (val)
mIoU74.12
75
Depth EstimationNYU v2 (val)--
65
Machine UnlearningMNIST
Model Accuracy99.08
56
Semantic segmentationNYU v2 (Retained set)
mIoU92.13
37
Multi-task Unlearning InterferenceNYU V2
UIS30.4
34
Depth EstimationNYU v2 (Retained set)
Acc (sigma 1.25)83.32
33
Surface Normal EstimationNYU v2 (Retained set)
A3058.47
33
Surface Normal PredictionNYU Forget set v2 (train)
A30 Error0.4593
30
Surface Normal PredictionNYU v2 (val)
A30 Accuracy51.38
30
Showing 10 of 15 rows

Other info

Follow for update