Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on OlympiadBench Math (test)

35.9Accuracy

TATA

2.41211.10619.828.494Jun 18, 2024Jul 28, 2024Sep 7, 2024Oct 18, 2024Nov 27, 2024Jan 7, 2025Feb 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
35.9
2025.02
35.3
2025.02
35.3
2025.02
34.4
2025.02
32.3
2025.02
31.7
2025.02
31.1
2025.02
30.8
2025.02
30.2
2025.02
27.4
2025.02
26.8
2025.02
26.2
2025.02
24.9
2025.02
24.6
2025.02
24.4
2025.02
23.3
2025.02
23.1
2025.02
23
2024.06
21.7
2025.02
21.5
2024.06
21.3
2025.02
21.3
2025.02
20.6
2024.06
20
2025.02
19.7
2024.06
19.3
2024.06
19.1
2024.06
19.1
2025.02
18.8
2025.02
18.8
2025.02
17.3
2024.06
16.3
2024.06
15.3
2025.02
15.3
2025.02
14.8
2024.06
14.7
2024.06
14.5
2024.06
14.2
2025.02
13.9
2024.06
13.6
2024.06
13.2
2024.06
13
2024.06
11.6
2024.06
10.8
2024.06
10.5
2024.06
9.6
2024.06
9.4
2024.06
9.3
2024.06
9.3
2024.06
8.7
2025.02
8.6
2025.02
7.9
2024.06
7.7
2025.02
7.7
2025.02
7.1
2024.06
5.9
2024.06
5.5
2024.06
4.2
2024.06
3.7