Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AMC23 (Avg@3)
Loading...
62.5
Avg@3 Score
ADORA
-2.5
14.375
31.25
48.125
Feb 10, 2026
Avg@3 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@3 Score
ADORA
Backbone=Qwen2.5-7B, T...
2026.02
62.5
GRPO
Backbone=Qwen2.5-7B, S...
2026.02
50
Qwen2.5-7B
Training Method=Base,...
2026.02
37.5
ADORA
Backbone=DeepSeek-Math...
2026.02
25
GRPO
Backbone=DeepSeek-Math...
2026.02
20
GRPO
Backbone=Llama-3.1-8B,...
2026.02
15
ADORA
Backbone=Llama-3.1-8B,...
2026.02
15
DeepSeek-Math-7B
Training Method=Base,...
2026.02
10
GRPO
Backbone=Mistral-v0.1-...
2026.02
10
ADORA
Backbone=Mistral-v0.1-...
2026.02
10
Llama-3.1-8B
Training Method=Base,...
2026.02
2.5
Mistral-v0.1-7B
Training Method=Base,...
2026.02
0
Feedback
Search any
task
Search any
task