Profile-Then-Reason: Bounded Semantic Complexity for Tool-Augmented Language Agents

About

Large language model agents that use external tools are often implemented through reactive execution, in which reasoning is repeatedly recomputed after each observation, increasing latency and sensitivity to error propagation. This work introduces Profile--Then--Reason (PTR), a bounded execution framework for structured tool-augmented reasoning, in which a language model first synthesizes an explicit workflow, deterministic or guarded operators execute that workflow, a verifier evaluates the resulting trace, and repair is invoked only when the original workflow is no longer reliable. A mathematical formulation is developed in which the full pipeline is expressed as a composition of profile, routing, execution, verification, repair, and reasoning operators; under bounded repair, the number of language-model calls is restricted to two in the nominal case and three in the worst case. Experiments against a ReAct baseline on six benchmarks and four language models show that PTR achieves the pairwise exact-match advantage in 16 of 24 configurations. The results indicate that PTR is particularly effective on retrieval-centered and decomposition-heavy tasks, whereas reactive execution remains preferable when success depends on substantial online adaptation.

Paulo Akira F. Enabe• 2026

Related benchmarks

Task	Dataset	Result
Question Answering	TriviaQA	EM82	182
Question Answering	NQ-Open	Exact Match (EM)46.9	32
Question Answering	StrategyQA	Exact Match (EM)82	16
Multi-hop Question Answering	HotpotQA	Exact Match (HotPotQA)12.2	8
Algebraic Reasoning	AQUA-RAT	PTR Advance Rate100	2
Mathematical Reasoning	GSM8K	PTR Advancement Accuracy3	2
Multi-hop Question Answering	HotpotQA	PTR Accuracy1	2
Question Answering	TriviaQA	PTR Advancement4	2
Question Answering	NQ-Open	PTR Adversarial Success Count4	2
Reasoning	StrategyQA	PTR Advancement3	2

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord