Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Orchestrate Agents under Uncertainty

About

Adaptive orchestration of heterogeneous agents requires making sequential delegation decisions under uncertain and evolving agent behaviour, e.g., coordinating specialised AI models with varying reliability, cost, and response quality. While prior work on agent orchestration focuses on performance or cost, uncertainty in agent reliability and output distributions is typically not modelled explicitly at the orchestration level. In this work, we study the problem of adaptive orchestration of heterogeneous agents under uncertainty, where a meta-controller must decide when to delegate to an agent, accounting for reliability, cost, and uncertainty. We propose BOT-Orch, a lightweight framework that recasts orchestration as a bandit problem over agents, regularized by OT distances between agent output distributions and task-specific reference distributions. We show that the regularised orchestration enjoys $\mathcal{O}(\sqrt{T})$ regret under standard assumptions, and provably induces preference ordering among agents with identical mean rewards but differing distributional alignment. Empirically, we demonstrate that BOT-Orch outperforms standard bandit and heuristic baselines in synthetic but adversarial task allocation settings with heterogeneous, non-i.i.d. agent behaviour.

Mary Chriselda Antony Oliver, Lan Jiang, Aaron Bundi Anampiu, Elaf Almahmoud, Francesco Quinzan, Umang Bhatt• 2026

Related benchmarks

TaskDatasetResultRank
Agent OrchestrationIID-G Synthetic Environment (test)
Event Rate0.63
4
Agent OrchestrationIID-M Synthetic Environment (test)
Event Rate0.65
4
Agent OrchestrationNonIID-BB Synthetic Environment (test)
Event Rate0.67
4
Agent OrchestrationNonIID-PS Synthetic Environment (test)
Event Rate0.66
4
Agent OrchestrationNonIID-SD Synthetic Environment (test)
Event Rate0.65
4
Agent-task matchingIID-G
Cumulative Alignment Cost537.4
4
Agent-task matchingIID-M
Cumulative Alignment Cost459.8
4
Agent-task matchingBB NonIID
Cumulative Alignment Cost410
4
Agent-task matchingNonIID-PS
Cumulative Alignment Cost571.4
4
Agent-task matchingNonIID-SD
Cumulative Alignment Cost564.1
4
Showing 10 of 17 rows

Other info

Follow for update