Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scaling up the Banded Matrix Factorization Mechanism for Differentially Private ML

About

Correlated noise mechanisms such as DP Matrix Factorization (DP-MF) have proven to be effective alternatives to DP-SGD in large-epsilon few-epoch training regimes. Significant work has been done to find the best correlated noise strategies, and the current state-of-the-art approach is DP-BandMF, which optimally balances the benefits of privacy amplification and noise correlation. Despite it's utility advantages, severe scalability limitations prevent this mechanism from handling large-scale training scenarios where the number of training iterations may exceed $10^4$ and the number of model parameters may exceed $10^7$. In this work, we present techniques to scale up DP-BandMF along these two dimensions, significantly extending it's reach and enabling it to handle settings with virtually any number of model parameters and training iterations, with negligible utility degradation.

Ryan McKenna• 2024

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisIMDB (test)
Accuracy91.65
306
Matrix Factorization UtilityBalls-in-Bins accountant n=2048, k=8
RMSE4.62
63
Differentially Private Model TrainingDP Noise Correlation Utility
RMSE (eps=8, w/o Amp)7.77
25
Image ClassificationCIFAR-10 (test)
Accuracy Epoch 127.7
10
Sentiment AnalysisIMDB (test)
Accuracy @ Epoch 176.66
10
Sentiment AnalysisIMDb BERT-base (test)
Mean Accuracy (ε=0.5)87.39
8
Image ClassificationCIFAR-10 (test)
Accuracy (eps=0.5)48.58
8
Showing 7 of 7 rows

Other info

Follow for update