Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MMLU Mathematics (test)

50Average Accuracy

DR+DA

23.58430.44237.344.158Nov 16, 2022May 30, 2023Dec 11, 2023Jun 23, 2024Jan 4, 2025Jul 18, 2025Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
50-----4
2026.01
45.3-----1.9
2026.01
44.5-----4.8
2022.11
41.32754.2374440.5-
2026.01
38.5-----3.4
2026.01
37.2-----1
2022.11
37.13341.533.33937.3-
2022.11
35.83338.132.64332.5-
2022.11
35.73141.531.93233.3-
2022.11
30.62533.623.73735.7-
2022.11
29.93030.226.33631.7-
2022.11
29.22828.926.73631-
2022.11
2833.330.725.22633.3-
2022.11
27.12827.226.73024.6-
2022.11
26.72125.724.43329.4-
2026.01
26.6-----1
2022.11
26.42526.7272526.2-
2022.11
24.62224.618.92531-