INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems
About
The rapid advancement of Large Language Model (LLM)-based Multi-Agent Systems (MAS) has introduced significant security vulnerabilities, where malicious influence can propagate virally through inter-agent communication. Conventional safeguards often rely on a binary paradigm that strictly distinguishes between benign and attack agents, failing to account for infected agents i.e., benign entities converted by attack agents. In this paper, we propose Infection-Aware Guard, INFA-Guard, a novel defense framework that explicitly identifies and addresses infected agents as a distinct threat category. By leveraging infection-aware detection and topological constraints, INFA-Guard accurately localizes attack sources and infected ranges. During remediation, INFA-Guard replaces attackers and rehabilitates infected ones, avoiding malicious propagation while preserving topological integrity. Extensive experiments demonstrate that INFA-Guard achieves state-of-the-art performance, reducing the Attack Success Rate (ASR) by an average of 33%, while exhibiting cross-model robustness, superior topological generalization, and high cost-effectiveness.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Prompt Injection | MMLU | ASR@315 | 31 | |
| Targeted Attack | InjecAgent | ASR@33.7 | 31 | |
| Malicious Advice Defense | PoisonRAG | ASR@36.1 | 18 | |
| Prompt Injection Defense | CSQA | ASR@313.4 | 16 | |
| Prompt Injection | MMLU random topology | ASR (k=1)16.3 | 16 | |
| Prompt Injection Defense | GSM8K PI (Prompt Injection) (test) | ASR@13.3 | 16 | |
| Prompt Injection Defense | PI (CSQA) random topology | ASR @124.3 | 16 | |
| Tool Attack Defense | InjecAgent random topology (test) | ASR@10.062 | 16 | |
| Prompt Injection Defense | GSM8K | ASR (Depth 3)3.3 | 3 |