Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Calling on BFCL Multiple v1 (test)
Loading...
92
Accuracy
Full-FT
43.64
56.195
68.75
81.305
Feb 12, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Full-FT
Backbone=Qwen3-8B-Base...
2026.02
92
PrefillShare
Backbone=Qwen3-8B-Base...
2026.02
91
PrefillShare
Backbone=LLaMA3.1-8B,...
2026.02
88.5
Full-FT
Backbone=LLaMA3.1-8B,...
2026.02
88
Qwen3-8B-Base
KV Sharing=Inherent
2026.02
80
LLaMA3.1-8B
KV Sharing=Inherent
2026.02
45.5
Feedback
Search any
task
Search any
task