Fine-Tuning Language Models with Differential Privacy through Adaptive Noise Allocation

About

Language models are capable of memorizing detailed patterns and information, leading to a double-edged effect: they achieve impressive modeling performance on downstream tasks with the stored knowledge but also raise significant privacy concerns. Traditional differential privacy based training approaches offer robust safeguards by employing a uniform noise distribution across all parameters. However, this overlooks the distinct sensitivities and contributions of individual parameters in privacy protection and often results in suboptimal models. To address these limitations, we propose ANADP, a novel algorithm that adaptively allocates additive noise based on the importance of model parameters. We demonstrate that ANADP narrows the performance gap between regular fine-tuning and traditional DP fine-tuning on a series of datasets while maintaining the required privacy constraints.

Xianzhi Li, Ran Zmigrod, Zhiqiang Ma, Xiaomo Liu, Xiaodan Zhu• 2024

Related benchmarks

Task	Dataset	Result	Rank
Instruction Fine-tuning	AI Research Instructions and Outputs	Accuracy (%)85.8		5

Showing 1 of 1 rows

Other info

Follow for update

@wizwand_team Discord