CASTER: Breaking the Cost-Performance Barrier in Multi-Agent Orchestration via Context-Aware Strategy for Task Efficient Routing
About
Graph-based Multi-Agent Systems (MAS) enable complex cyclic workflows but suffer from inefficient static model allocation, where deploying strong models uniformly wastes computation on trivial sub-tasks. We propose CASTER (Context-Aware Strategy for Task Efficient Routing), a lightweight router for dynamic model selection in graph-based MAS. CASTER employs a Dual-Signal Router that combines semantic embeddings with structural meta-features to estimate task difficulty. During training, the router self-optimizes through a Cold Start to Iterative Evolution paradigm, learning from its own routing failures via on-policy negative feedback. Experiments using LLM-as-a-Judge evaluation across Software Engineering, Data Analysis, Scientific Discovery, and Cybersecurity demonstrate that CASTER reduces inference cost by up to 72.4% compared to strong-model baselines while matching their success rates, and consistently outperforms both heuristic routing and FrugalGPT across all domains.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Task-Efficient Routing | Software scenario | Cost0.0186 | 32 | |
| Task-Efficient Routing | Security scenario | Cost0.0049 | 20 | |
| Task-Efficient Routing | Data scenario | Cost0.0255 | 20 | |
| Task-Efficient Routing | Science scenario | Cost0.0695 | 18 | |
| Quality Scoring | Data scenario | Average Score84.6 | 15 | |
| Quality Scoring | Science scenario | Avg. Score97.6 | 15 | |
| Quality Scoring | Security scenario | Average Score96.2 | 15 | |
| Task Routing | Data | Cost ($)0.0344 | 15 | |
| Quality Scoring | Software scenario | Average Score100 | 15 | |
| Task Routing | Science | Cost ($)0.0695 | 15 |