Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Hassle-free Algorithm for Private Learning in Practice: Don't Use Tree Aggregation, Use BLTs

About

The state-of-the-art for training on-device language models for mobile keyboard applications combines federated learning (FL) with differential privacy (DP) via the DP-Follow-the-Regularized-Leader (DP-FTRL) algorithm. Two variants of DP-FTRL are used in practice, tree aggregation and matrix factorization. However, tree aggregation suffers from significantly suboptimal privacy/utility tradeoffs, while matrix mechanisms require expensive optimization parameterized by hard-to-estimate-in-advance constants, and high runtime memory costs. This paper extends the recently introduced Buffered Linear Toeplitz (BLT) mechanism to multi-participation scenarios. Our BLT-DP-FTRL maintains the ease-of-use advantages of tree aggregation, while essentially matching matrix factorization in terms of utility and privacy. We evaluate BLT-DP-FTRL on the StackOverflow dataset, serving as a re-producible simulation benchmark, and across four on-device language model tasks in a production FL system. Our empirical results highlight the advantages of the BLT mechanism and elevate the practicality and effectiveness of DP in real-world scenarios.

H. Brendan McMahan, Zheng Xu, Yanxiang Zhang• 2024

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisIMDB (test)
Accuracy91.48
306
Matrix Factorization UtilityBalls-in-Bins accountant n=2048, k=8
RMSE4.87
63
Image ClassificationCIFAR-10 (test)
Accuracy (eps=0.5)51.09
8
Sentiment AnalysisIMDb BERT-base (test)
Mean Accuracy (ε=0.5)88.29
8
Showing 4 of 4 rows

Other info

Follow for update