Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on OlympiadBench (pass@1, pass@5)

0.1132Pass@1

Base Model

0.0560.070850.08570.10055Oct 4, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.10
0.11320.1956
2025.10
0.11210.2021
2025.10
0.10760.2047
2025.10
0.09190.1836
2025.10
0.06420.1083
2025.10
0.05820.1062