Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Profile-Then-Reason: Bounded Semantic Complexity for Tool-Augmented Language Agents

About

Large language model agents that use external tools are often implemented through reactive execution, in which reasoning is repeatedly recomputed after each observation, increasing latency and sensitivity to error propagation. This work introduces Profile--Then--Reason (PTR), a bounded execution framework for structured tool-augmented reasoning, in which a language model first synthesizes an explicit workflow, deterministic or guarded operators execute that workflow, a verifier evaluates the resulting trace, and repair is invoked only when the original workflow is no longer reliable. A mathematical formulation is developed in which the full pipeline is expressed as a composition of profile, routing, execution, verification, repair, and reasoning operators; under bounded repair, the number of language-model calls is restricted to two in the nominal case and three in the worst case. Experiments against a ReAct baseline on six benchmarks and four language models show that PTR achieves the pairwise exact-match advantage in 16 of 24 configurations. The results indicate that PTR is particularly effective on retrieval-centered and decomposition-heavy tasks, whereas reactive execution remains preferable when success depends on substantial online adaptation.

Paulo Akira F. Enabe• 2026

Related benchmarks

TaskDatasetResultRank
Question AnsweringTriviaQA
EM82
182
Question AnsweringNQ-Open
Exact Match (EM)46.9
32
Question AnsweringStrategyQA
Exact Match (EM)82
16
Multi-hop Question AnsweringHotpotQA
Exact Match (HotPotQA)12.2
8
Algebraic ReasoningAQUA-RAT
PTR Advance Rate100
2
Mathematical ReasoningGSM8K
PTR Advancement Accuracy3
2
Multi-hop Question AnsweringHotpotQA
PTR Accuracy1
2
Question AnsweringTriviaQA
PTR Advancement4
2
Question AnsweringNQ-Open
PTR Adversarial Success Count4
2
ReasoningStrategyQA
PTR Advancement3
2
Showing 10 of 10 rows

Other info

Follow for update