Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Understanding on MMLU (Accuracy, Throughput, and Latency)

84.9Accuracy

Verify-Only

43.71654.40865.175.792Dec 2, 2025Dec 3, 2025Dec 5, 2025Dec 7, 2025Dec 8, 2025Dec 10, 2025Dec 12, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
84.910.9520.4081.042
2025.12
84.54.69-0.446
2025.12
84.110.735-1.021
2025.12
83.88.746-1
2025.12
83.710.511-1
2025.12
83.711.436-1.088
2025.12
83.64.376-0.553
2025.12
83.64.376-0.5
2025.12
83.57.912-1
2025.12
83.57.97-1.007
2025.12
83.59.085-1.039
2025.12
83.511.3950.4071.084
2025.12
83.38.4810.4181.072
2025.12
83.38.92-1.02
2025.12
83.18.023-1.014
2025.12
82.28.6230.4191.09
2025.12
81.69.3850.4651.073
2025.12
81.39.9370.4631.136
2025.12
80.81--1.56
2025.12
80.81--1.56
2025.12
80.77--1.62
2025.12
80.76--1.43
2025.12
80.75--1.43
2025.12
80.69--1
2025.12
80.69--1.54
2025.12
79.31--5
2025.12
79.31--2.5
2025.12
78.73--1.14
2025.12
72.538.544-3.667
2025.12
69.533.274-4.206
2025.12
45.395.141-10.878