Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems

About

While existing multi-agent systems (MAS) can handle complex problems by enabling collaboration among multiple agents, they are often highly task-specific, relying on manually crafted agent roles and interaction prompts, which leads to increased architectural complexity and limited reusability across tasks. Moreover, most MAS communicate primarily through natural language, making them vulnerable to error accumulation and instability in long-context, multi-stage interactions within internal agent histories. In this work, we propose \textbf{Agent Primitives}, a set of reusable latent building blocks for LLM-based MAS. Inspired by neural network design, where complex models are built from reusable components, we observe that many existing MAS architectures can be decomposed into a small number of recurring internal computation patterns. Based on this observation, we instantiate three primitives: Review, Voting and Selection, and Planning and Execution. All primitives communicate internally via key-value (KV) cache, which improves both robustness and efficiency by mitigating information degradation across multi-stage interactions. To enable automatic system construction, an Organizer agent selects and composes primitives for each query, guided by a lightweight knowledge pool of previously successful configurations, forming a primitive-based MAS. Experiments show that primitives-based MAS improve average accuracy by 12.0-16.5\% over single-agent baselines, reduce token usage and inference latency by approximately 3$\times$-4$\times$ compared to text-based MAS, while incurring only 1.3$\times$-1.6$\times$ overhead relative to single-agent inference and providing more stable performance across model backbones.

Haibo Jin, Kuang Peng, Ye Yu, Xiaopeng Yuan, Haohan Wang• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH
Accuracy72.4
643
Mathematical ReasoningGSM8K
Accuracy (GSM8K)93.8
358
Question AnsweringGPQA
Accuracy53.2
258
Code GenerationHumanEval+--
189
Mathematical Problem SolvingMATH
Accuracy79.8
166
Medical Question AnsweringMedQA
Accuracy82.7
109
Math Word Problem SolvingGSM8K
Accuracy95.6
91
Code GenerationMBPP+
Accuracy75.9
75
Question AnsweringGPQA Diamond
Accuracy66.7
62
Mathematical Problem SolvingAIME 25
Accuracy73.3
54
Showing 10 of 16 rows

Other info

Follow for update