Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Search on Frames
Loading...
70.5
Score
Qwen3-235B
36.492
45.321
54.15
62.979
Jan 30, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Qwen3-235B
Reasoning protocol=non...
2026.01
70.5
SYNTHAGENT-14B
Reasoning protocol=non...
2026.01
63.5
ToolStar-14B
Reasoning protocol=non...
2026.01
60.4
SYNTHAGENT-8B
Reasoning protocol=non...
2026.01
59.7
ToolStar-8B
Reasoning protocol=non...
2026.01
58.5
Qwen3-32B
Reasoning protocol=non...
2026.01
44.8
Qwen3-14B
Reasoning protocol=non...
2026.01
37.8
Feedback
Search any
task
Search any
task