Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Agentic Reasoning on HLE

41.6Overall Score

ChatGPT-Agent

13.769620.994828.2235.4452Feb 9, 2026
Updated 4d ago

Evaluation Results

MethodLinks
41.6-
2026.02
4040.87
2026.02
34.5236.1
26.9-
2026.02
26.9-
2026.02
26.6-
2026.02
14.8415.04
2026.02
-31
2026.02
-39.2
2026.02
-32.9