Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Decision Inference on MMLU

0.772Accuracy

Ours

0.685680.708090.73050.75291Feb 18, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
0.772
2025.02
0.753
2025.02
0.746
2025.02
0.743
2025.02
0.737
2025.02
0.723
2025.02
0.721
2025.02
0.707
2025.02
0.705
2025.02
0.703
2025.02
0.689