Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Problem Solving on AIME VeRA-H Pro 2024-II

78.6Avg@5 Accuracy

GPT-5-high

26.640.153.667.1Jan 23, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
78.6-10
74.3-20
68.6-22.9
2026.01
67.1-27.1
67.1-20
67.1-25.7
65.7-27.1
62.9-12.9
60-30
58.6-31.4
57.1-31.4
2026.01
51.4-31.4
51.4-28.6
45.7-40
32.9-38.6
28.6-24.3