Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MasRouter: Learning to Route LLMs for Multi-Agent Systems

About

Multi-agent systems (MAS) powered by Large Language Models (LLMs) have been demonstrated to push the boundaries of LLM capabilities, yet they often incur significant costs and face challenges in dynamic LLM selection. Current LLM routing methods effectively reduce overhead in single-agent scenarios by customizing LLM selection for each query, but they overlook the critical decisions regarding collaboration modes and agent roles in MAS. In response to this challenge, we first introduce the problem of Multi-Agent System Routing (MASR), which integrates all components of MAS into a unified routing framework. Toward this goal, we propose MasRouter, the first high-performing, cost-effective, and inductive MASR solution. MasRouter employs collaboration mode determination, role allocation, and LLM routing through a cascaded controller network, progressively constructing a MAS that balances effectiveness and efficiency. Extensive experiments demonstrate that MasRouter is (1) high-performing, achieving a $1.8\%\sim8.2\%$ improvement over the state-of-the-art method on MBPP; (2) economical, reducing overhead by up to $52.07\%$ compared to SOTA methods on HumanEval; and (3) plug-and-play, seamlessly integrating with mainstream MAS frameworks, reducing overhead by $17.21\%\sim28.17\%$ via customized routing. The code is available at https://github.com/yanweiyue/masrouter.

Yanwei Yue, Guibin Zhang, Boyang Liu, Guancheng Wan, Kun Wang, Dawei Cheng, Yiyan Qi• 2025

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval
Pass@190.62
850
Mathematical ReasoningGSM8K (test)
Accuracy95.45
751
Mathematical ReasoningGSM8K
Accuracy92
351
Code GenerationMBPP (test)--
276
Mathematical ReasoningMATH
Accuracy91.11
162
General KnowledgeMMLU (test)
Accuracy84.25
33
Code GenerationHumanEval++
Accuracy98.44
22
Multi-task Language UnderstandingMMLU-Redux
Accuracy88.33
22
Code GenerationMBPP
Pass@184
20
Multi-task Language UnderstandingMMLU
Accuracy84.25
20
Showing 10 of 31 rows

Other info

Follow for update