PrivAct: Internalizing Contextual Privacy Preservation via Multi-Agent Preference Training
About
Large language model (LLM) agents are increasingly deployed in personalized tasks involving sensitive, context-dependent information, where privacy violations may arise in agents' action due to the implicitness of contextual privacy. Existing approaches rely on external, inference-time interventions which are brittle, scenario-specific, and may expand the privacy attack surface. We propose PrivAct, a contextual privacy-aware multi-agent learning framework that internalizes contextual privacy preservation directly into models' generation behavior for privacy-compliant agentic actions. By embedding privacy preferences into each agent, PrivAct enhances system-wide contextual integrity while achieving a more favorable privacy-helpfulness tradeoff. Experiments across multiple LLM backbones and benchmarks demonstrate consistent improvements in contextual privacy preservation, reducing leakage rates by up to 12.32% while maintaining comparable helpfulness, as well as zero-shot generalization and robustness across diverse multi-agent topologies. Code is available at https://github.com/chengyh23/PrivAct.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Contextual Privacy Preservation | PrivacyLens (test) | Privacy Leak Avg0.18 | 34 | |
| Multi-agent Sequential Communication | AgentLeak | Privacy Score60 | 20 | |
| Privacy and utility evaluation in multi-agent communication | PrivacyLens | Privacy Score81.5 | 15 | |
| Information flow management in multi-party professional scenarios | ConfAIde Tier 4 (test) | MS-E Score80 | 12 | |
| Theory of Mind for privacy control | ConfAIde Tier 3 (test) | FR (E) Score86.037 | 12 | |
| Multi-agent Sequential Communication | MAGPIE | Privacy Score83 | 10 | |
| Communication channel leakage evaluation | AgentLeak (test) | Privacy Score0.551 | 10 | |
| Multi-agent privacy and utility evaluation | AgentLeak Sequential | Privacy56.5 | 10 | |
| Multi-agent privacy and utility evaluation | AgentLeak Hierarchical | Privacy54.5 | 10 | |
| Privacy and Utility Evaluation | PrivacyLens Sequential, 4 agents | Privacy Score83.5 | 5 |