From Easy to Hard: Building a Shortcut for Differentially Private Image Synthesis
About
Differentially private (DP) image synthesis aims to generate synthetic images from a sensitive dataset, alleviating the privacy leakage concerns of organizations sharing and utilizing synthetic images. Although previous methods have significantly progressed, especially in training diffusion models on sensitive images with DP Stochastic Gradient Descent (DP-SGD), they still suffer from unsatisfactory performance. In this work, inspired by curriculum learning, we propose a two-stage DP image synthesis framework, where diffusion models learn to generate DP synthetic images from easy to hard. Unlike existing methods that directly use DP-SGD to train diffusion models, we propose an easy stage in the beginning, where diffusion models learn simple features of the sensitive images. To facilitate this easy stage, we propose to use `central images', simply aggregations of random samples of the sensitive dataset. Intuitively, although those central images do not show details, they demonstrate useful characteristics of all images and only incur minimal privacy costs, thus helping early-phase model training. We conduct experiments to present that on the average of four investigated image datasets, the fidelity and utility metrics of our synthetic images are 33.1% and 2.1% better than the state-of-the-art method.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Generation | CelebA 64 x 64 (test) | FID89.4 | 203 | |
| Image Generation | CelebA 32x32 (test) | FID24.8 | 17 | |
| Differentially Private Image Synthesis | MNIST | FID3.4 | 16 | |
| Differentially Private Image Synthesis | F-MNIST | FID13.3 | 16 | |
| Differentially Private Image Synthesis | CIFAR-10 | FID95.3 | 16 | |
| Differentially Private Image Synthesis | CelebA | FID24.8 | 16 | |
| Differentially Private Image Synthesis | CAMELYON | FID27.8 | 16 | |
| Image Generation | CelebA 128x128 (test) | FID173.6 | 14 |