Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agent Execution on EnterpriseBench (test)

55Execution Accuracy

Claude-3.5-Sonnet

34.239.64550.4Mar 23, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.03
55
2026.03
55
2026.03
51
2026.03
47
2026.03
41
2026.03
40
2026.03
38
2026.03
35