Differentiable Mixture-of-Agents Incentivizes Swarm Intelligence of Large Language Models

About

Recent advances in Large Language Models (LLMs) have catalyzed the development of multi-agent systems (MAS) for complex reasoning tasks. However, existing MAS typically rely on pre-defined or pre-compiled communication topologies, which limits their flexibility and adaptability to dynamic task requirements. In this work, we propose Differentiable Mixture-of-Agents (DMoA), a self-evolving multi-agent framework that enables elastic and adaptive agent collaboration during inference. Instead of statically constructing workflows, DMoA dynamically routes and activates agents at each reasoning step, allowing the system to implicitly simulate diverse communication topologies and adapt to evolving demands. To achieve this, we design a differentiable, context-aware routing mechanism that leverages recurrent structures to incorporate historical and contextual information, producing sparse agent activations in a step-wise manner. Furthermore, we introduce predictive entropy as self-supervised signals to optimize the routing process, enabling efficient test-time adaptation without external annotations. Extensive experiments across 9 benchmarks demonstrate that DMoA achieves state-of-the-art performance while exhibiting strong efficiency, robustness, and ensembling capabilities.

Xingjian Wu, Junkai Lu, Siyu Yan, Xiangfei Qiu, Jilin Hu, Chenjuan Guo, Bin Yang• 2026

Related benchmarks

Task	Dataset	Result
Math Reasoning	GSM8K	Accuracy98.87	254
Math Reasoning	AQUA	Accuracy86.6	194
Coding	HumanEval	Accuracy95.62	84
Reasoning	MMLU	Accuracy91.35	57
Question Answering	HotpotQA	Accuracy90.38	50
Medical Reasoning	DDXPlus	Accuracy (DDXPlus)83.37	36
Coding Ability	DS-1000	Accuracy64.34	19
Mathematical Reasoning	MultiArith	Accuracy99.15	19

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord