EMA: Auditing Data Removal from Trained Models
About
Data auditing is a process to verify whether certain data have been removed from a trained model. A recently proposed method (Liu et al. 20) uses Kolmogorov-Smirnov (KS) distance for such data auditing. However, it fails under certain practical conditions. In this paper, we propose a new method called Ensembled Membership Auditing (EMA) for auditing data removal to overcome these limitations. We compare both methods using benchmark datasets (MNIST and SVHN) and Chest X-ray datasets with multi-layer perceptrons (MLP) and convolutional neural networks (CNN). Our experiments show that EMA is robust under various conditions, including the failure cases of the previously proposed method. Our code is available at: https://github.com/Hazelsuko07/EMA.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Auditing Data Removal | MNIST Memorized (train folds M1-M5) | pks1 | 36 | |
| Auditing Data Removal | Chest X-ray C1 | PKS Score1 | 10 | |
| Auditing Data Removal | Chest X-ray C2 | PKS Score1 | 10 | |
| Auditing Data Removal | Chest X-ray C3 | PKS Score1 | 10 | |
| Auditing Data Removal | Chest X-ray C4 | pks Score1 | 10 | |
| Auditing Data Removal | Chest X-ray C5 | PKS Score1 | 10 | |
| Auditing Data Removal | Chest X-ray C6 | PKS Score0.00e+0 | 10 | |
| Auditing Data Removal | Chest X-ray R | PKS Score0.00e+0 | 10 | |
| Auditing Data Removal | MNIST M6 Disjoint Set | PKS0.00e+0 | 7 | |
| Auditing Data Removal | MNIST Disjoint (M6) | p_EMA0.00e+0 | 6 |