Optimal Unbiased Randomizers for Regression with Label Differential Privacy
About
We propose a new family of label randomizers for training regression models under the constraint of label differential privacy (DP). In particular, we leverage the trade-offs between bias and variance to construct better label randomizers depending on a privately estimated prior distribution over the labels. We demonstrate that these randomizers achieve state-of-the-art privacy-utility trade-offs on several datasets, highlighting the importance of reducing bias when training neural networks with label DP. We also provide theoretical results shedding light on the structural properties of the optimal unbiased randomizers.
Ashwinkumar Badanidiyuru, Badih Ghazi, Pritish Kamath, Ravi Kumar, Ethan Leeman, Pasin Manurangsi, Avinash V Varadarajan, Chiyuan Zhang• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Regression | Criteo Sponsored Search Conversion Log (test) | MSE2.80e+3 | 78 | |
| Regression | Communities and Crime 1990 US Census / 1990 US LEMAS / 1995 FBI UCR (test (20%)) | MSE (Mean)0.0194 | 78 | |
| Regression | California Housing Standard (test) | MSE0.611 | 78 |
Showing 3 of 3 rows