Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling

About

Computer-use agents (CUAs) automate tasks specified with natural language such as "order the cheapest item from Taco Bell" by generating sequences of calls to tools such as click, type, and scroll on a browser. Current implementations follow a sequential fetch-screenshot-execute loop where each iteration requires an LLM call, resulting in high latency and frequent errors from incorrect tool use. We present agent just-in-time (JIT) compilation, a system that compiles task descriptions directly into executable code that may include LLM calls, tool calls, and parallelization. Our approach comprises three components: (1) JIT-Planner, which generates multiple code plans, validates each against tool specifications, and selects the minimum-cost candidate; (2) JIT-Scheduler, which explores parallelization strategies via Monte Carlo cost estimation from learned latency distributions; and (3) an invariant-enforcing tool protocol specifying precondition and postcondition requirements to reduce the rate of incorrect tool use. Across five applications, JIT-Planner achieves $10.4\times$ speedup and 28$\%$ higher accuracy over Browser-Use, while JIT-Scheduler achieves $2.4\times$ speedup and 9\% higher accuracy over OpenAI CUA.

Caleb Winston, Ron Yifeng Wang, Azalia Mirhoseini, Christos Kozyrakis• 2026

Related benchmarks

TaskDatasetResultRank
Web Agent AutomationDashdish
End-to-end Latency (s)14.5
5
Web Agent AutomationGitLab
End-to-end Latency (seconds)9.9
5
Web Agent AutomationGomail
End-to-end Latency (s)10.4
5
Web Agent AutomationOmnizon
End-to-end Latency (s)15.5
5
Web Agent AutomationREDDIT
End-to-end Latency (s)4.3
5
Web Agent AutomationWeb Applications Overall
End-to-end Latency (s)10.9
5
Web Task CompletionGitLab
Accuracy87
5
Web Task CompletionGomail
Accuracy96
5
Web Task CompletionOmnizon
Accuracy89
5
Web Task CompletionREDDIT
Accuracy93
5
Showing 10 of 12 rows

Other info

Follow for update