Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AMC23 (Average Score)
Loading...
91
Average Score
Qwen2.5-7B-Instruct
74.36
78.68
83
87.32
Jan 30, 2026
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
Qwen2.5-7B-Instruct
Framework=Multi-Dimens...
2026.01
91
R1-Searcher
Category=RL-only TIR M...
2026.01
89
AutoTraj
Category=SFT-RL TIR Me...
2026.01
89
Tool-Star-SFT
Category=SFT-only TIR...
2026.01
88
AutoTIR
Category=RL-only TIR M...
2026.01
87
Tool-Star
Category=SFT-RL TIR Me...
2026.01
82
Vanilla SFT-RL TIR
Category=SFT-RL TIR Me...
2026.01
80
ReSearch
Category=RL-only TIR M...
2026.01
76
ToRL
Category=RL-only TIR M...
2026.01
75
Feedback
Search any
task
Search any
task