Share your thoughts, 1 month free Claude Pro on usSee more

Factual Knowledge on MMLU-Pro

58.4EM

GPT-4o-mini

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4o-mini 2026.01		58.4	0.61
PIR 2026.01		52.87	1.32
Reasoning Base 2026.01		51.21	2.04
PIR 2026.01		50.29	1.33
PIR 2026.01		50	1.31