Share your thoughts, 1 month free Claude Pro on usSee more

Language Understanding on MMLU 1-order

55.2Accuracy

Raw

Updated 2mo ago

Evaluation Results

Method	Links
Raw 2026.03		55.2
MOSAIC 2026.03		55.1
ORPO 2026.03		54.9
SFT 2026.03		54.7
In-context 2026.03		53.4
Raw 2026.03		50.7
MOSAIC 2026.03		49.8
SFT 2026.03		49.2
In-context 2026.03		48.9
ORPO 2026.03		48.9