Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Large Language Models are Advanced Anonymizers

About

Recent privacy research on large language models (LLMs) has shown that they achieve near-human-level performance at inferring personal data from online texts. With ever-increasing model capabilities, existing text anonymization methods are currently lacking behind regulatory requirements and adversarial threats. In this work, we take two steps to bridge this gap: First, we present a new setting for evaluating anonymization in the face of adversarial LLM inferences, allowing for a natural measurement of anonymization performance while remedying some of the shortcomings of previous metrics. Then, within this setting, we develop a novel LLM-based adversarial anonymization framework leveraging the strong inferential capabilities of LLMs to inform our anonymization procedure. We conduct a comprehensive experimental evaluation of adversarial anonymization across 13 LLMs on real-world and synthetic online texts, comparing it against multiple baselines and industry-grade anonymizers. Our evaluation shows that adversarial anonymization outperforms current commercial anonymizers both in terms of the resulting utility and privacy. We support our findings with a human study (n=50) highlighting a strong and consistent human preference for LLM-anonymized texts.

Robin Staab, Mark Vero, Mislav Balunovi\'c, Martin Vechev• 2024

Related benchmarks

TaskDatasetResultRank
Attribute InferenceSynthetic dataset
Attribute Inference Accuracy28.76
33
Attribute InferenceReal-world Reddit-derived dataset 1.0 (test)
Attribute Inference Accuracy42.86
30
Text AnonymizationSynthPAI
Privacy64
22
Text AnonymizationDB-Bio
Privacy Score78
17
Text AnonymizationMedQA
Privacy Score24.4
16
Text AnonymizationPUPA
Privacy Score94.2
16
Text AnonymizationTAB
Privacy Score62.1
16
Text AnonymizationPersonalReddit
Privacy Score36.5
14
Text Anonymizationreddit-self-disclosure
Utility Score0.9218
8
Text AnonymizationHuman evaluation
PPP7.5
5
Showing 10 of 12 rows

Other info

Follow for update