Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unlearnable Examples: Making Personal Data Unexploitable

About

The volume of "free" data on the internet has been key to the current success of deep learning. However, it also raises privacy concerns about the unauthorized exploitation of personal data for training commercial models. It is thus crucial to develop methods to prevent unauthorized data exploitation. This paper raises the question: \emph{can data be made unlearnable for deep learning models?} We present a type of \emph{error-minimizing} noise that can indeed make training examples unlearnable. Error-minimizing noise is intentionally generated to reduce the error of one or more of the training example(s) close to zero, which can trick the model into believing there is "nothing" to learn from these example(s). The noise is restricted to be imperceptible to human eyes, and thus does not affect normal data utility. We empirically verify the effectiveness of error-minimizing noise in both sample-wise and class-wise forms. We also demonstrate its flexibility under extensive experimental settings and practicability in a case study of face recognition. Our work establishes an important first step towards making personal data unexploitable to deep learning models.

Hanxun Huang, Xingjun Ma, Sarah Monazam Erfani, James Bailey, Yisen Wang• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationCIFAR-10 (test)
Accuracy89.15
3381
Image ClassificationImageNet (test)--
235
Image ClassificationSUN397 (test)
Top-1 Accuracy38.48
136
Image ClassificationFlowers (test)
Accuracy50.58
87
Adversarial RobustnessCIFAR-10 (test)--
76
Binary ClassificationCH (test)
Accuracy76.12
64
Image ClassificationCars (test)
Accuracy54.43
57
Image ClassificationFood (test)
Accuracy79.43
50
Classificationdry-bean (test)
Accuracy47.36
39
Showing 10 of 18 rows

Other info

Follow for update