Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding on CODE (test)
Loading...
3.62
Turns
PROMPTED
3.4536
3.4968
3.54
3.5832
Feb 18, 2026
Turns
U
C
Accuracy
Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
Turns
U
C
Accuracy
Reward
PROMPTED
Training Status=Withou...
2026.02
3.62
2.67
1.42
95.8
0.229
RL
Training Status=With R...
2026.02
3.51
2.13
1.39
99.7
0.259
CTA-PROMPTED
Training Status=Withou...
2026.02
3.47
2.51
1.41
94.5
0.24
CTA-RL
Training Status=With R...
2026.02
3.46
1.98
1.46
99.1
0.268
Feedback
Search any
task
Search any
task