Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME decontaminated 24 (Accuracy)

27.9Accuracy

DAPO-Math-17k

Updated 1mo ago

Evaluation Results

Method	Links
DAPO-Math-17k 2026.05		27.9
DAPO++ 2026.05		25.9
DeepScaleR 2026.05		23.4
Skywork-OR1-RL-Data 2026.05		17.3
DeepMath-103K 2026.05		17.1
OpenR1-Math-220k 2026.05		16.5
Qwen3-8B-Base 2026.05		10.4
DAPO++ 2026.05		9.4
DeepMath-103K 2026.05		8.4
DAPO-Math-17k 2026.05		7
OpenR1-Math-220k 2026.05		6.9
DeepScaleR 2026.05		6.3
Skywork-OR1-RL-Data 2026.05		5.8
Qwen3-1.7B-Base 2026.05		4.7