Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on GSM8K (In-Writing Metrics)
Loading...
73.8
NL-to-Format
Qwen3-1.7B
34.8
44.925
55.05
65.175
Jan 12, 2026
NL-to-Format
In-Writing-Base
In-Writing-IF
In-Writing-BF
Updated 4d ago
Evaluation Results
Method
Method
Links
NL-to-Format
In-Writing-Base
In-Writing-IF
In-Writing-BF
Qwen3-1.7B
Shot=0
2026.01
73.8
59
53.3
52.8
Qwen3-1.7B
Shot=4
2026.01
73.2
69.4
64.6
67.6
Qwen3-1.7B
Shot=1
2026.01
71
65.9
58.2
60.2
SmolLM2-1.7B
Shot=4
2026.01
39.8
39.1
34
38.8
SmolLM2-1.7B
Shot=1
2026.01
38.6
38.4
30.4
27.4
SmolLM2-1.7B
Shot=0
2026.01
36.3
36.6
41.1
40.1
Feedback
Search any
task
Search any
task