Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Web Interaction on xbench DeepSearch 2510 (test)

66Pass@1

GPT-5

6.40821.87937.3552.821Apr 4, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.04
66-
2026.04
51-
2026.04
47-
2026.04
44-
2026.04
4460
2026.04
38.353
2026.04
35-
2026.04
30-
2026.04
27-
2026.04
8.716