Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Calling on BFCL V3
Loading...
70.4
pass@1
Qwen3 14B
64.16
65.78
67.4
69.02
Dec 15, 2025
pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
pass@1
Qwen3 14B
Parameters=14B
2025.12
70.4
Gemini-2.5 Flash-Thinking
Thinking Mode=true
2025.12
68.6
Qwen3 8B
Parameters=8B
2025.12
68.1
DeepSeek-R1 0528 671B
Parameters=671B, Think...
2025.12
67.9
Nemotron-Cascade 14B-Thinking
Parameters=14B, Thinki...
2025.12
67.5
Nemotron-Nano 9B-v2
Parameters=9B-v2
2025.12
66.9
Nemotron Cascade-8B
Parameters=8B, Thinkin...
2025.12
64.4
Feedback
Search any
task
Search any
task