Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph Generation

About

Multi-agent systems (MAS) based on large language models (LLMs) have emerged as a powerful solution for dealing with complex problems across diverse domains. The effectiveness of MAS is critically dependent on its collaboration topology, which has become a focal point for automated design research. However, existing approaches are fundamentally constrained by their reliance on a template graph modification paradigm with a predefined set of agents and hard-coded interaction structures, significantly limiting their adaptability to task-specific requirements. To address these limitations, we reframe MAS design as a conditional autoregressive graph generation task, where both the system composition and structure are designed jointly. We propose ARG-Designer, a novel autoregressive model that operationalizes this paradigm by constructing the collaboration graph from scratch. Conditioned on a natural language task query, ARG-Designer sequentially and dynamically determines the required number of agents, selects their appropriate roles from an extensible pool, and establishes the optimal communication links between them. This generative approach creates a customized topology in a flexible and extensible manner, precisely tailored to the unique demands of different tasks. Extensive experiments across six diverse benchmarks demonstrate that ARG-Designer not only achieves state-of-the-art performance but also enjoys significantly greater token efficiency and enhanced extensibility. The source code of ARG-Designer is available at https://github.com/Shiy-Li/ARG-Designer.

Shiyuan Li, Yixin Liu, Qingsong Wen, Chengqi Zhang, Shirui Pan• 2025

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval
Pass@188.25
1036
Multitask Language UnderstandingMMLU
Accuracy89.54
413
Mathematical ReasoningSVAMP
Accuracy95.63
403
Arithmetic ReasoningMultiArith
Accuracy98.93
229
Math ReasoningAQUA
Accuracy85.51
78
Knowledge ReasoningMMLU
MMLU Knowledge Reasoning Accuracy85.04
65
Medical Question AnsweringDDXPlus
Accuracy76.03
43
Multi-hop Question AnsweringHotpotQA
Avg@8 Accuracy86.45
32
Multiple-choice Question AnsweringAQUA
Accuracy79.48
31
Code GenerationDS-1000
Pass@155.6
28
Showing 10 of 10 rows

Other info

Follow for update