Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Reasoning on HLE

38.4Accuracy

Gemini-3.0

2.800812.042921.28530.5271Feb 2, 2026Feb 3, 2026Feb 5, 2026Feb 7, 2026Feb 8, 2026Feb 10, 2026Feb 12, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
38.413,000
37.5-
2026.02
31.524,000
2026.02
25.81-
25.1-
2026.02
25.121,000
24.8-
2026.02
23.929,000
21.6-
2026.02
18.86-
2026.02
18.16-
2026.02
17.52-
2026.02
16.4-
2026.02
10.57-
2026.02
10.47-
2026.02
10.06-
2026.02
9.68-
2026.02
5.51-
2026.02
5.33-
2026.02
4.4-
2026.02
4.17-