Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CASTER: Breaking the Cost-Performance Barrier in Multi-Agent Orchestration via Context-Aware Strategy for Task Efficient Routing

About

Graph-based Multi-Agent Systems (MAS) enable complex cyclic workflows but suffer from inefficient static model allocation, where deploying strong models uniformly wastes computation on trivial sub-tasks. We propose CASTER (Context-Aware Strategy for Task Efficient Routing), a lightweight router for dynamic model selection in graph-based MAS. CASTER employs a Dual-Signal Router that combines semantic embeddings with structural meta-features to estimate task difficulty. During training, the router self-optimizes through a Cold Start to Iterative Evolution paradigm, learning from its own routing failures via on-policy negative feedback. Experiments using LLM-as-a-Judge evaluation across Software Engineering, Data Analysis, Scientific Discovery, and Cybersecurity demonstrate that CASTER reduces inference cost by up to 72.4% compared to strong-model baselines while matching their success rates, and consistently outperforms both heuristic routing and FrugalGPT across all domains.

Shanyv Liu, Xuyang Yuan, Tao Chen, Zijun Zhan, Zhu Han, Danyang Zheng, Weishan Zhang, Shaohua Cao• 2026

Related benchmarks

TaskDatasetResultRank
Task-Efficient RoutingSoftware scenario
Cost0.0186
32
Task-Efficient RoutingSecurity scenario
Cost0.0049
20
Task-Efficient RoutingData scenario
Cost0.0255
20
Task-Efficient RoutingScience scenario
Cost0.0695
18
Quality ScoringData scenario
Average Score84.6
15
Quality ScoringScience scenario
Avg. Score97.6
15
Quality ScoringSecurity scenario
Average Score96.2
15
Task RoutingData
Cost ($)0.0344
15
Quality ScoringSoftware scenario
Average Score100
15
Task RoutingScience
Cost ($)0.0695
15
Showing 10 of 37 rows

Other info

Follow for update