Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Reasoning on BBEH

78.8Accuracy

Gemini 3-Pro

5.27224.36143.4562.539Dec 2, 2025Dec 12, 2025Dec 23, 2025Jan 3, 2026Jan 13, 2026Jan 24, 2026Feb 4, 2026
Updated 3d ago

Evaluation Results

MethodLinks
78.8-
69-
68.8-
67.04-
2026.02
66.63-
2025.12
14.9-
2025.12
12.3-
2025.12
12.3-
2025.12
12.2-
2025.12
11.8-
2025.12
11.8-
2025.12
11.3-
2025.12
11.2-
2025.12
10.8-
2025.12
10.8-
2025.12
10.5-
2025.12
10.4-
2025.12
8.3-
2025.12
8.1-
2026.01
-44.5
2026.01
-33.8
2026.01
-9.9
2026.01
-12.9
2026.01
-13.1
2026.01
-22.1
2026.01
-15.6
2026.01
-18
2026.01
-27.4
2026.01
-34.1