Share your thoughts, 1 month free Claude Pro on usSee more

Language Understanding on MMLU 3-order

55.2Accuracy

Raw

Updated 2mo ago

Evaluation Results

Method	Links
Raw 2026.03		55.2
MOSAIC 2026.03		55.1
ORPO 2026.03		54.9
SFT 2026.03		54.7
In-context 2026.03		52.7
Raw 2026.03		50.7
MOSAIC 2026.03		49.4
ORPO 2026.03		49.1
SFT 2026.03		49
In-context 2026.03		47.2