MAS-on-the-Fly: Dynamic Adaptation of LLM-based Multi-Agent Systems at Test Time

About

Large Language Model (LLM)-based multi-agent systems (MAS) have emerged as a promising paradigm for solving complex tasks. However, existing works often rely on manual designs or "one-size-fits-all" automation, lacking dynamic adaptability after deployment. Inspired by how biological systems adapt, we introduce MASFly, a novel multi-agent framework enabling dynamic adaptation at test time. To adapt system generation, MASFly employs a retrieval-augmented SOP instantiation mechanism that leverages a self-constructed repository of successful collaboration patterns, enabling the LLM to assemble customized MASs for new queries. For adaptive execution, MASFly incorporates an experience-guided supervision mechanism, where a dedicated Watcher agent monitors system behaviors with reference to a personalized experience pool and provides real-time interventions. Extensive experiments demonstrate that MASFly achieves state-of-the-art performance, most notably a 61.7% success rate on the TravelPlanner benchmark, while exhibiting strong task adaptability and robustness.

Guangyi Liu, Haojun Lin, Huan Zeng, Heng Wang, Quanming Yao• 2026

Related benchmarks

Task	Dataset	Result
Code Generation	HumanEval	Pass@198.73	1043
Long horizon planning	TravelPlanner	Delivery Rate98.9	13
Code Generation	MBPP Pro	Pass@185.56	9
General Assistant Reasoning	GAIA	GAIA Level 1 Score46.3	9

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord