Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on HLE decontaminated (Accuracy)

8.4Accuracy

DeepMath-103K

Updated 1mo ago

Evaluation Results

Method	Links
DeepMath-103K 2026.05		8.4
DAPO++ 2026.05		6.8
DeepScaleR 2026.05		6.3
Qwen3-1.7B-Base 2026.05		5.9
DAPO-Math-17k 2026.05		5.9
Qwen3-8B-Base 2026.05		5.7
Skywork-OR1-RL-Data 2026.05		5.1
OpenR1-Math-220k 2026.05		4.7
DAPO++ 2026.05		4.7
DeepScaleR 2026.05		4.5
DeepMath-103K 2026.05		4.5
DAPO-Math-17k 2026.05		4.1
OpenR1-Math-220k 2026.05		3.9
Skywork-OR1-RL-Data 2026.05		3.1