Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Enterprise interface interaction on WorkArena L2 full benchmark

69.4Success Rate

GPT-5

9.18424.81740.4556.083Apr 9, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
69.4
2026.04
40.4
2026.04
11.5