Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Use on StableToolBench G1 Instruction
Loading...
75.5
SL Score
Trace-Based
69.78
71.265
72.75
74.235
Feb 23, 2026
SL Score
QL Score
Updated 4d ago
Evaluation Results
Method
Method
Links
SL Score
QL Score
Trace-Based
Evaluation Protocol=Tr...
2026.02
75.5
66.9
D1
Evaluation Protocol=Tr...
2026.02
74.7
66.1
D2
Evaluation Protocol=Tr...
2026.02
73.4
62.5
D0
Evaluation Protocol=Tr...
2026.02
72.8
62.3
Play2Prompt
Evaluation Protocol=Tr...
2026.02
72
62.5
DRAFT
Evaluation Protocol=Tr...
2026.02
70
58
Feedback
Search any
task
Search any
task