Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on SVAMP (Pass@20)
Loading...
0.9733
Pass@20
Entropy-Tree
0.956036
0.960518
0.965
0.969482
Jan 2, 2026
Pass@20
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@20
Entropy-Tree
Model=Qwen2.5-14B-Inst...
2026.01
0.9733
Multi-chain
Model=Qwen2.5-14B-Inst...
2026.01
0.97
Entropy-Tree
Model=Qwen2.5-7B-Instruct
2026.01
0.96
Multi-chain
Model=Qwen2.5-32B-Inst...
2026.01
0.96
Entropy-Tree
Model=Qwen2.5-32B-Inst...
2026.01
0.96
Multi-chain
Model=Qwen2.5-7B-Instruct
2026.01
0.9567
Feedback
Search any
task
Search any
task