Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on OlympiadBench (Avg@3)

36Avg@3

ADORA

Updated 4mo ago

Evaluation Results

Method	Links
ADORA 2026.02		36
GRPO 2026.02		35.1
Qwen2.5-7B 2026.02		26.3
ADORA 2026.02		12.9
GRPO 2026.02		12
ADORA 2026.02		10.5
GRPO 2026.02		5.3
ADORA 2026.02		4.7
GRPO 2026.02		4.1
Llama-3.1-8B 2026.02		3.1
DeepSeek-Math-7B 2026.02		3
Mistral-v0.1-7B 2026.02		2.4