Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Math on AIME-Extend
Loading...
52.67
Accuracy
AdaRAS
20.0764
28.5382
37
45.4618
Jan 27, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
AdaRAS
Category=Steering
2026.01
52.67
Probing
Category=Steering
2026.01
48
CoT
Prompting=Vanilla CoT,...
2026.01
47.33
OpenReasoning-Nemotron-1.5B
Category=Post-training
2026.01
47.33
OpenThinker-3-1.5B
Category=Post-training
2026.01
42.67
DeepSeek-R1-Distill-Qwen-1.5B
Category=Post-training
2026.01
21.33
Feedback
Search any
task
Search any
task