Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mask-Free Privacy Extraction and Rewriting: A Domain-Aware Approach via Prototype Learning

About

Client-side privacy rewriting is crucial for deploying LLMs in privacy-sensitive domains. However, existing approaches struggle to balance privacy and utility. Full-text methods often distort context, while span-level approaches rely on impractical manual masks or brittle static dictionaries. Attempts to automate localization via prompt-based LLMs prove unreliable, as they suffer from unstable instruction following that leads to privacy leakage and excessive context scrubbing. To address these limitations, we propose DAMPER (Domain-Aware Mask-free Privacy Extraction and Rewriting). DAMPER operationalizes latent privacy semantics into compact Domain Privacy Prototypes via contrastive learning, enabling precise, autonomous span localization. Furthermore, we introduce a Prototype-Guided Preference Alignment, which leverages learned prototypes as semantic anchors to construct preference pairs, optimizing a domain-compliant rewriting policy without human annotations. At inference time, DAMPER integrates a sampling-based Exponential Mechanism to provide rigorous span-level Differential Privacy (DP) guarantees. Extensive experiments demonstrate that DAMPER significantly outperforms existing baselines, achieving a superior privacy-utility trade-off.

Xiaodong Li, Yuhua Wang, Qingchen Yu, Zixuan Qin, Yifan Sun, Qinnan Zhang, Hainan Zhang, Zhiming Zheng• 2026

Related benchmarks

TaskDatasetResultRank
Medical Diagnosis ClassificationPri-DDXPlus (test)
Accuracy79.71
7
Medical Diagnosis ClassificationPri-SLJA (test)
Accuracy83.17
7
Medical Diagnosis ClassificationPri-Mixture (test)
Accuracy80.01
7
Privacy RewritingDDXPlus Pri
Accuracy78.13
7
Privacy RewritingPri-SLJA
Accuracy82.68
7
Privacy RewritingPri-Mixture
Accuracy78.29
7
Showing 6 of 6 rows

Other info

Follow for update