Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph Generation
About
Multi-agent systems (MAS) based on large language models (LLMs) have emerged as a powerful solution for dealing with complex problems across diverse domains. The effectiveness of MAS is critically dependent on its collaboration topology, which has become a focal point for automated design research. However, existing approaches are fundamentally constrained by their reliance on a template graph modification paradigm with a predefined set of agents and hard-coded interaction structures, significantly limiting their adaptability to task-specific requirements. To address these limitations, we reframe MAS design as a conditional autoregressive graph generation task, where both the system composition and structure are designed jointly. We propose ARG-Designer, a novel autoregressive model that operationalizes this paradigm by constructing the collaboration graph from scratch. Conditioned on a natural language task query, ARG-Designer sequentially and dynamically determines the required number of agents, selects their appropriate roles from an extensible pool, and establishes the optimal communication links between them. This generative approach creates a customized topology in a flexible and extensible manner, precisely tailored to the unique demands of different tasks. Extensive experiments across six diverse benchmarks demonstrate that ARG-Designer not only achieves state-of-the-art performance but also enjoys significantly greater token efficiency and enhanced extensibility. The source code of ARG-Designer is available at https://github.com/Shiy-Li/ARG-Designer.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | HumanEval | Pass@188.25 | 850 | |
| Arithmetic Reasoning | MultiArith | Accuracy97.35 | 181 | |
| Multi-hop Question Answering | HotpotQA | Avg@8 Accuracy86.45 | 32 | |
| Multiple-choice Question Answering | AQUA | Accuracy79.48 | 31 | |
| Code Generation | DS-1000 | Pass@155.6 | 28 | |
| Medical Question Answering | DDXPlus | Accuracy76.03 | 28 | |
| Knowledge Reasoning | MMLU | MMLU Knowledge Reasoning Accuracy85.04 | 19 |