Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Generation on 133 PowerShell CodeGen prompts
Loading...
46
FRate (%)
o3-mini
0.24
12.12
24
35.88
Jan 10, 2026
FRate (%)
SRate (%)
Updated 4d ago
Evaluation Results
Method
Method
Links
FRate (%)
SRate (%)
o3-mini
Decoding=Default (no t...
2026.01
46
30
GPT-4o
Decoding=Deterministic...
2026.01
42
34
Qwen2.5-7B
Decoding=Deterministic...
2026.01
24
34
Qwen2.5-Coder-7B
Decoding=Deterministic...
2026.01
20
22
DeepSeek-R1-Distill-Qwen-7B
Decoding=Deterministic...
2026.01
2
0
Feedback
Search any
task
Search any
task