Why Search When You Can Transfer? Amortized Agentic Workflow Design from Structural Priors
About
Automated agentic workflow design currently relies on per-task iterative search, which is computationally prohibitive and fails to reuse structural knowledge across tasks. We observe that optimized workflows converge to a small family of domain-specific topologies, suggesting that this combinatorial search is largely redundant. Building on this insight, we propose SWIFT (Synthesizing Workflows via Few-shot Transfer), a framework that amortizes workflow design into reusable structural priors. SWIFT first distills compositional heuristics and output-interface contracts from contrastive analysis of prior search trajectories across source tasks. At inference time, it conditions a single LLM generation pass on these priors together with cross-task workflow demonstrations to synthesize a complete, executable workflow for an unseen target task, bypassing iterative search entirely. On five benchmarks, SWIFT outperforms the state-of-the-art search-based method while reducing marginal per-task optimization cost by three orders of magnitude. It further generalizes to four additional unseen benchmarks and transfers successfully from GPT-4o-mini to three additional foundation models (Grok, Qwen, Gemma). Controlled ablations reveal that workflow demonstrations primarily transfer topological structure rather than surface semantics: replacing all operator names with random strings still retains over 93% of the full system's average performance.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | HumanEval (test) | -- | 612 | |
| Code Generation | MBPP (test) | -- | 405 | |
| Math Reasoning | AQUA | Accuracy79.9 | 188 | |
| Mathematical Reasoning | AIME 24/25 | Accuracy14.6 | 171 | |
| Arithmetic Reasoning | MultiArith (test) | Accuracy98.54 | 115 | |
| Code Generation | BigCodeBench | Accuracy34.3 | 73 | |
| Mathematical Reasoning | MATH (test) | Accuracy55.14 | 41 | |
| Mathematical Reasoning | MATH (test) | Execution Cost1.73 | 14 |