G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks

About

Recent advancements in large language model (LLM)-based agents have demonstrated that collective intelligence can significantly surpass the capabilities of individual agents, primarily due to well-crafted inter-agent communication topologies. Despite the diverse and high-performing designs available, practitioners often face confusion when selecting the most effective pipeline for their specific task: \textit{Which topology is the best choice for my task, avoiding unnecessary communication token overhead while ensuring high-quality solution?} In response to this dilemma, we introduce G-Designer, an adaptive, efficient, and robust solution for multi-agent deployment, which dynamically designs task-aware, customized communication topologies. Specifically, G-Designer models the multi-agent system as a multi-agent network, leveraging a variational graph auto-encoder to encode both the nodes (agents) and a task-specific virtual node, and decodes a task-adaptive and high-performing communication topology. Extensive experiments on six benchmarks showcase that G-Designer is: \textbf{(1) high-performing}, achieving superior results on MMLU with accuracy at $84.50\%$ and on HumanEval with pass@1 at $89.90\%$; \textbf{(2) task-adaptive}, architecting communication protocols tailored to task difficulty, reducing token consumption by up to $95.33\%$ on HumanEval; and \textbf{(3) adversarially robust}, defending against agent adversarial attacks with merely $0.3\%$ accuracy drop.

Guibin Zhang, Yanwei Yue, Xiangguo Sun, Guancheng Wan, Miao Yu, Junfeng Fang, Kun Wang, Tianlong Chen, Dawei Cheng• 2024

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K	Accuracy93.97	1398
Code Generation	HumanEval	Pass@193.54	1043
Multi-task Language Understanding	MMLU	Accuracy87.2	881
Language Understanding	MMLU	Accuracy86.92	844
Code Generation	HumanEval (test)	Pass@195.6	612
Multitask Language Understanding	MMLU	Accuracy86.92	520
Mathematical Reasoning	GSM8K	Accuracy87.23	499
Multi-task Language Understanding	MMLU	MMLU Accuracy66.67	442
Code Generation	MBPP (test)	Pass@190.9	405
Mathematical Reasoning	SVAMP	Accuracy93.1	403

Showing 10 of 113 rows

...

Other info

Follow for update

@wizwand_team Discord