Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Differentiable Mixture-of-Agents Incentivizes Swarm Intelligence of Large Language Models

About

Recent advances in Large Language Models (LLMs) have catalyzed the development of multi-agent systems (MAS) for complex reasoning tasks. However, existing MAS typically rely on pre-defined or pre-compiled communication topologies, which limits their flexibility and adaptability to dynamic task requirements. In this work, we propose Differentiable Mixture-of-Agents (DMoA), a self-evolving multi-agent framework that enables elastic and adaptive agent collaboration during inference. Instead of statically constructing workflows, DMoA dynamically routes and activates agents at each reasoning step, allowing the system to implicitly simulate diverse communication topologies and adapt to evolving demands. To achieve this, we design a differentiable, context-aware routing mechanism that leverages recurrent structures to incorporate historical and contextual information, producing sparse agent activations in a step-wise manner. Furthermore, we introduce predictive entropy as self-supervised signals to optimize the routing process, enabling efficient test-time adaptation without external annotations. Extensive experiments across 9 benchmarks demonstrate that DMoA achieves state-of-the-art performance while exhibiting strong efficiency, robustness, and ensembling capabilities.

Xingjian Wu, Junkai Lu, Siyu Yan, Xiangfei Qiu, Jilin Hu, Chenjuan Guo, Bin Yang• 2026

Related benchmarks

TaskDatasetResultRank
Math ReasoningGSM8K
Accuracy98.87
254
Math ReasoningAQUA
Accuracy86.6
188
CodingHumanEval
Accuracy95.62
60
ReasoningMMLU
Accuracy91.35
54
Question AnsweringHotpotQA
Accuracy90.38
37
Medical ReasoningDDXPlus
Accuracy (DDXPlus)83.37
36
Coding AbilityDS-1000
Accuracy64.34
19
Mathematical ReasoningMultiArith
Accuracy99.15
19
Showing 8 of 8 rows

Other info

Follow for update