Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Problem Solving on MATH (Acc@t1/t2 Evaluation)

47.4Accuracy @ t1

Prompt based

6.00816.75427.538.246May 22, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.05
47.4--
2025.05
47.462.815.4
2025.05
47.46214.6
2025.05
40.8--
2025.05
40.849.68.8
2025.05
40.848.67.8
2025.05
14.4--
2025.05
14.4150.6
2025.05
14.423.69.2
2025.05
14.414.50.1
2025.05
14.415.20.8
2025.05
14.414.80.4
2025.05
10--
2025.05
9.2--
2025.05
9.210.61.4
2025.05
9.210.21
2025.05
9.2100.8
2025.05
7.6--