Share your thoughts, 1 month free Claude Pro on usSee more

Multi-domain Knowledge and Reasoning on MMLU-Pro

77.7Accuracy

Qwen3

Updated 3mo ago

Evaluation Results

Method	Links
Qwen3 2026.04		77.7	2,400
Apriel-Reasoner (Ours) 2026.04		77.3	1,900
Phi-4-reasoning 2026.04		77.1	3,400
Nemotron-Cascade 2026.04		76.8	3,600
Apriel-Base 2026.04		76.4	3,500
Apriel-Base + RLVR w/ LP 2026.04		75.6	1,500
COACT 2026.04		24.17	-
Pref + Ent 2026.04		23.45	-
Entropy 2026.04		22.91	-
Pref Certainty 2026.04		22.18	-
Random 2026.04		21.54	-