Privacy-Preserving Model Transcription with Differentially Private Synthetic Distillation
About
While many deep learning models trained on private datasets have been deployed in various practical tasks, they may pose a privacy leakage risk as attackers could recover informative data or label knowledge from models. In this work, we present \emph{privacy-preserving model transcription}, a data-free model-to-model conversion solution to facilitate model deployment with a privacy guarantee. To this end, we propose a cooperative-competitive learning approach termed \emph{differentially private synthetic distillation} that learns to convert a pretrained model (teacher) into its privacy-preserving counterpart (student) via a trainable generator without access to private data. The learning collaborates with three players in a unified framework and performs alternate optimization: i)~the generator is learned to generate synthetic data, ii)~the teacher and student accept the synthetic data and compute differential private labels by flexible data or label noisy perturbation, and iii)~the student is updated with noisy labels and the generator is updated by taking the student as a discriminator for adversarial training. We theoretically prove that our approach can guarantee differential privacy and convergence. The transcribed student has good performance and privacy protection, while the resulting generator can generate private synthetic data for downstream tasks. Extensive experiments clearly demonstrate that our approach outperforms 26 state-of-the-arts.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR10 (test) | Accuracy86.59 | 585 | |
| Image Classification | MNIST (test) | Accuracy (ε=10)97.85 | 14 | |
| Image Classification | CelebA-G (test) | Accuracy (ε=10)91.23 | 12 | |
| Image Classification | MedMNIST (test) | Clean Accuracy86.01 | 11 | |
| Image Classification | CelebA-H (test) | Accuracy (ε=1)79.21 | 11 | |
| Image Classification | ImageNet (test) | Accuracy39.13 | 10 | |
| Image Classification | COVIDx (test) | Accuracy81.73 | 6 | |
| Image Classification | CIFAR10 (test) | Test Accuracy (%)71.12 | 6 |