Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems

About

While existing multi-agent systems (MAS) can handle complex problems by enabling collaboration among multiple agents, they are often highly task-specific, relying on manually crafted agent roles and interaction prompts, which leads to increased architectural complexity and limited reusability across tasks. Moreover, most MAS communicate primarily through natural language, making them vulnerable to error accumulation and instability in long-context, multi-stage interactions within internal agent histories. In this work, we propose \textbf{Agent Primitives}, a set of reusable latent building blocks for LLM-based MAS. Inspired by neural network design, where complex models are built from reusable components, we observe that many existing MAS architectures can be decomposed into a small number of recurring internal computation patterns. Based on this observation, we instantiate three primitives: Review, Voting and Selection, and Planning and Execution. All primitives communicate internally via key-value (KV) cache, which improves both robustness and efficiency by mitigating information degradation across multi-stage interactions. To enable automatic system construction, an Organizer agent selects and composes primitives for each query, guided by a lightweight knowledge pool of previously successful configurations, forming a primitive-based MAS. Experiments show that primitives-based MAS improve average accuracy by 12.0-16.5\% over single-agent baselines, reduce token usage and inference latency by approximately 3$\times$-4$\times$ compared to text-based MAS, while incurring only 1.3$\times$-1.6$\times$ overhead relative to single-agent inference and providing more stable performance across model backbones.

Haibo Jin, Peng Kuang, Ye Yu, Xiaopeng Yuan, Haohan Wang• 2026

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	MATH	Accuracy72.4	882
Code Generation	HumanEval+	--	393
Mathematical Reasoning	GSM8K	Accuracy (GSM8K)93.8	358
Question Answering	GPQA	Accuracy53.2	258
Code Generation	MBPP+	Accuracy75.9	236
Mathematical Problem Solving	MATH	Accuracy79.8	229
Medical Question Answering	MedQA	Accuracy82.7	153
Math Word Problem Solving	GSM8K	Accuracy95.6	111
Question Answering	GPQA Diamond	Accuracy66.7	97
Mathematical Problem Solving	AIME 25	Accuracy73.3	71

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord