Share your thoughts, 1 month free Claude Pro on usSee more

Massive Multitask Language Understanding on MMLU (Sub-category Performance)

82.7STEM Accuracy

LeanQuant

Updated 2mo ago

Evaluation Results

Method	Links
LeanQuant 2026.05		82.7	83.2	90.6	87.7	86.1
OSAQ+GPTQ 2026.05		82.6	83.2	90.8	87.7	86.1
GPTQ 2026.05		82.3	82.6	90.5	87.5	85.7
OSAQ+GPTQ 2026.05		76.7	77.4	89.3	85.7	82.3
LeanQuant 2026.05		76.6	77.3	89.2	85.9	82.3
GPTQ 2026.05		76.3	77.2	89.3	85.2	82