Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Problem Solving and Unsolvability Detection on AIME 24-25

95Solvable Accuracy

Gemini-3

33.01649.10865.281.292Dec 1, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
9521.258.1
2025.12
8540.362.7
2025.12
69.640.455
2025.12
67.914.841.4
2025.12
61.742.552.1
2025.12
38.321.229.8
2025.12
35.428.732.1