Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Construct Practical Agentic Systems

About

Automated design and optimization of agentic LLM-based systems leads to sophisticated systems that substantially improve result quality over off-the-shelf agentic patterns. However, studies of fielded agentic systems show that production systems focus much more on issues such as simplicity, controllability, and predictability of inference costs. In this paper we propose principled approaches to designing and optimizing practical agentic systems. We describe an agent framework that enables designers to enforce modularity in agentic systems, by defining "pseudo-tools" that call LLMs recursively on a restricted context. Using this framework we hand-engineer agents for a diverse set of tasks, and show that relative to dynamically-planned workflows, hand-constructed fixed workflows are generally cheaper and more accurate. We then propose novel learning methods for the agentic components required by this framework, namely pseudo-tools and fixed workflows. These learning methods generally outperform hand-engineered agents. We also exploit the modularity of the framework to apply multi-objective optimization methods to jointly optimize cost and response quality and blend the results of multiple learning systems.

Aditya Kumar, Zhihan Lei, Jerry Yan, Joshua W. Momo, Lauhitya Reddy, Rafael Enrique Cabrera Jimenez, Cassandra A. Cohen, Arthur Kajiyama, William W. Cohen• 2026

Related benchmarks

TaskDatasetResultRank
Common Sense ReasoningBBH Sports Understanding
Accuracy (BBH Sports)87
21
Financial Question AnsweringFinQA
Accuracy75
3
Medical Agentic ReasoningMedAgentBench
Accuracy87
3
Medical Rule-followingMedCalc Rules
Accuracy50
3
Multi-step ReasoningMUSR Murder
Accuracy68
3
Multi-step ReasoningMUSR Objects
Accuracy58
3
Multi-step ReasoningMUSR Teams
Accuracy61
3
PlanningNaturalPlan Calendar
Accuracy62
3
PlanningNaturalPlan Meeting
Accuracy37
3
PlanningNaturalPlan Trip
Accuracy21
3
Showing 10 of 19 rows

Other info

Follow for update