On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
About
Large language model-based multi-agent systems have shown great abilities across various tasks due to the collaboration of expert agents, each focusing on a specific domain. However, the impact of clumsy or even malicious agents--those who frequently make errors in their tasks--on the overall performance of the system remains underexplored. This paper investigates: (1) What is the resilience of various system structures (e.g., A$\rightarrow$B$\rightarrow$C, A$\leftrightarrow$B$\leftrightarrow$C) under faulty agents, on different downstream tasks? (2) How can we increase system resilience to defend against these agents? To simulate faulty agents, we propose two approaches--AutoTransform and AutoInject--which introduce mistakes into the agents' responses. Experiments on four downstream tasks using six systems show that the "hierarchical" structure, i.e., A$\rightarrow$(B$\leftrightarrow$C), exhibits superior resilience with the lowest performance drop of 5.5%, compared to 10.5% and 23.7% of other two structures. To further improve resilience, we introduce (1) Challenger, that introduces a mechanism for each agent to challenge others' outputs, and (2) Inspector, an additional agent to review and correct messages, recovering up to 96.4% errors made by faulty agents. Our code and data are available at https://github.com/CUHK-ARISE/MAS-Resilience.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Prompt Injection | MMLU | ASR@319.2 | 91 | |
| Prompt Injection | CSQA | ASR62 | 36 | |
| Trojan Attack | InjecAgent | ASR45 | 36 | |
| Malicious Advice Defense | PoisonRAG | ASR25 | 36 | |
| Prompt Injection | MATH | Attack Success Rate (ASR)37 | 36 | |
| Commonsense Reasoning | CSQA | Task Success Rate (TSR)64.75 | 30 | |
| General Knowledge Question Answering | MMLU | Task Success Rate (TSR)76.75 | 30 | |
| Logical Inference | LogiQA | Task Success Rate (TSR)49.75 | 30 | |
| Answer Accuracy | Alpaca | BRT Accuracy38.9 | 26 | |
| Answer Accuracy | SamSum | BRT Accuracy38 | 26 |