G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
About
Recent advancements in large language model (LLM)-based agents have demonstrated that collective intelligence can significantly surpass the capabilities of individual agents, primarily due to well-crafted inter-agent communication topologies. Despite the diverse and high-performing designs available, practitioners often face confusion when selecting the most effective pipeline for their specific task: \textit{Which topology is the best choice for my task, avoiding unnecessary communication token overhead while ensuring high-quality solution?} In response to this dilemma, we introduce G-Designer, an adaptive, efficient, and robust solution for multi-agent deployment, which dynamically designs task-aware, customized communication topologies. Specifically, G-Designer models the multi-agent system as a multi-agent network, leveraging a variational graph auto-encoder to encode both the nodes (agents) and a task-specific virtual node, and decodes a task-adaptive and high-performing communication topology. Extensive experiments on six benchmarks showcase that G-Designer is: \textbf{(1) high-performing}, achieving superior results on MMLU with accuracy at $84.50\%$ and on HumanEval with pass@1 at $89.90\%$; \textbf{(2) task-adaptive}, architecting communication protocols tailored to task difficulty, reducing token consumption by up to $95.33\%$ on HumanEval; and \textbf{(3) adversarially robust}, defending against agent adversarial attacks with merely $0.3\%$ accuracy drop.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Mathematical Reasoning | GSM8K | Accuracy93.97 | 983 | |
| Code Generation | HumanEval | Pass@193.54 | 850 | |
| Multi-task Language Understanding | MMLU | Accuracy87.2 | 842 | |
| Language Understanding | MMLU | Accuracy86.92 | 756 | |
| Code Generation | HumanEval (test) | Pass@195.6 | 444 | |
| Mathematical Reasoning | SVAMP | Accuracy90.29 | 368 | |
| Mathematical Reasoning | GSM8K | Accuracy87.23 | 351 | |
| Mathematical Reasoning | AIME | AIME Accuracy78.62 | 283 | |
| Code Generation | MBPP (test) | Pass@190.9 | 276 | |
| Arithmetic Reasoning | MultiArith | Accuracy97.6 | 181 |