Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

KMMLU

Benchmarks

Task NameDataset NameSOTA ResultTrend
KnowledgeKMMLU-Pro
Score70.9
7
Text-to-TextKMMLU Korean
Score75.2
4
General Language UnderstandingKMMLU-Pro
Score64
4
General Language UnderstandingKMMLU
Overall Score73
4
KnowledgeKMMLU Redux
Score75.9
3
KnowledgeKMMLU
Score0.787
3
Showing 6 of 6 rows