Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on MATH500 (Accuracy)

84.6Accuracy

INSTRUCT (BASE)

Updated 4mo ago

Evaluation Results

Method	Links
INSTRUCT (BASE) 2026.02		84.6
DPO-R1 (ZHANG ET AL., 2025) 2026.02		84.4
DPO-R1 (HIGH) 2026.02		84.4
PACE 2026.02		84.4
DPO-R1 (LOW) 2026.02		84.2
DPO-R1 (MIDDLE) 2026.02		84.2
DPO-R1 (LOW) 2026.02		83.2
DPO-R1 (ZHANG ET AL., 2025) 2026.02		83.2
DPO-R1 (HIGH) 2026.02		83.2
DPO-R1 (MIDDLE) 2026.02		82.4
PACE 2026.02		82.2
INSTRUCT (BASE) 2026.02		81.2
DPO-R1 (LOW) 2026.02		37.8
PACE 2026.02		37
DPO-R1 (ZHANG ET AL., 2025) 2026.02		35.8
DPO-R1 (HIGH) 2026.02		35.6
DPO-R1 (MIDDLE) 2026.02		34.8
INSTRUCT (BASE) 2026.02		29