Optimal Unbiased Randomizers for Regression with Label Differential Privacy

About

We propose a new family of label randomizers for training regression models under the constraint of label differential privacy (DP). In particular, we leverage the trade-offs between bias and variance to construct better label randomizers depending on a privately estimated prior distribution over the labels. We demonstrate that these randomizers achieve state-of-the-art privacy-utility trade-offs on several datasets, highlighting the importance of reducing bias when training neural networks with label DP. We also provide theoretical results shedding light on the structural properties of the optimal unbiased randomizers.

Ashwinkumar Badanidiyuru, Badih Ghazi, Pritish Kamath, Ravi Kumar, Ethan Leeman, Pasin Manurangsi, Avinash V Varadarajan, Chiyuan Zhang• 2023

Related benchmarks

Task	Dataset	Result
Regression	Criteo Sponsored Search Conversion Log (test)	MSE2.80e+3	78
Regression	Communities and Crime 1990 US Census / 1990 US LEMAS / 1995 FBI UCR (test (20%))	MSE (Mean)0.0194	78
Regression	California Housing Standard (test)	MSE0.611	78

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord