Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AMC 23 (Avg@16, #Token)
Loading...
82.1
Avg@16 Score
DLER-R1-1.5B
62.548
67.624
72.7
77.776
Mar 11, 2026
Avg@16 Score
Token Count
Updated 1mo ago
Evaluation Results
Method
Method
Links
Avg@16 Score
Token Count
DLER-R1-1.5B
Category=Length-orient...
2026.03
82.1
2,419
GRPO
Category=Performance-o...
2026.03
81.9
9,917
GR³
Category=Performance-o...
2026.03
81.6
4,153
Laser-DE-L4096-1.5B
Category=Length-orient...
2026.03
73.4
3,110
DeepSeek-R1-Distill-1.5B
Category=Initial model
2026.03
70.8
9,351
LCR1-1.5B
Category=Length-orient...
2026.03
67.8
4,170
AdaptThink-1.5B
Category=Length-orient...
2026.03
63.3
2,859
Feedback
Search any
task
Search any
task