INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems

About

The rapid advancement of Large Language Model (LLM)-based Multi-Agent Systems (MAS) has introduced significant security vulnerabilities, where malicious influence can propagate virally through inter-agent communication. Conventional safeguards often rely on a binary paradigm that strictly distinguishes between benign and attack agents, failing to account for infected agents i.e., benign entities converted by attack agents. In this paper, we propose Infection-Aware Guard, INFA-Guard, a novel defense framework that explicitly identifies and addresses infected agents as a distinct threat category. By leveraging infection-aware detection and topological constraints, INFA-Guard accurately localizes attack sources and infected ranges. During remediation, INFA-Guard replaces attackers and rehabilitates infected ones, avoiding malicious propagation while preserving topological integrity. Extensive experiments demonstrate that INFA-Guard achieves state-of-the-art performance, reducing the Attack Success Rate (ASR) by an average of 33%, while exhibiting cross-model robustness, superior topological generalization, and high cost-effectiveness.

Yijin Zhou, Xiaoya Lu, Dongrui Liu, Junchi Yan, Jing Shao• 2026

Related benchmarks

Task	Dataset	Result
Prompt Injection	MMLU	ASR@315	91
Targeted Attack	InjecAgent	ASR@33.7	55
Malicious Advice Defense	PoisonRAG	ASR7.3	36
Prompt Injection	MATH	Attack Success Rate (ASR)21.7	36
Trojan Attack	InjecAgent	ASR20.3	36
Prompt Injection	CSQA	ASR29.7	36
Prompt Injection Defense	CSQA	ASR@313.4	16
Prompt Injection	MMLU random topology	ASR (k=1)16.3	16
Prompt Injection Defense	GSM8K PI (Prompt Injection) (test)	ASR@13.3	16
Prompt Injection Defense	PI (CSQA) random topology	ASR @124.3	16

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord