Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Knowledge Reasoning on MMLU-CF

75.9Accuracy

GHG-TDA

71.94872.9747475.026Feb 10, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
75.9
2026.02
75.2
2026.02
75
2026.02
74.4
2026.02
74.3
2026.02
74.1
2026.02
73.2
2026.02
73
2026.02
73
2026.02
72.1