Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Understanding on MMLU-ProX (T1)

26.4Accuracy

Naive Fine-tuning

-0.637926.3815413.40120.42046Apr 22, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
26.4
2026.04
26.1
2026.04
25.9
2026.04
25.8
2026.04
25.7
2026.04
0.408
2026.04
0.406
2026.04
0.405
2026.04
0.402
2026.04
0.402