Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Function Calling on BFCL Non-Live v3
Loading...
94
Overall Accuracy
Qwen2.5-14B
77.672
81.911
86.15
90.389
Feb 4, 2026
Overall Accuracy
Function Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Overall Accuracy
Function Accuracy
Qwen2.5-14B
Speedup ratio=1×
2026.02
94
99.3
Qwen3-4B
Speedup ratio=1×
2026.02
93.7
99.3
ST-Qwen2.5-7B
Speedup ratio=4.5×
2026.02
93.5
99.8
Qwen2.5-7B
Speedup ratio=1×
2026.02
93
99.7
Qwen2.5-3B
Speedup ratio=1×
2026.02
92.5
99.7
ST-Qwen3-4B
Speedup ratio=3.9×
2026.02
92.5
99.5
ST-Qwen2.5-14B
Speedup ratio=4.4×
2026.02
92.2
99.7
ST-Qwen2.5-3B
Speedup ratio=3.8×
2026.02
90.3
99.8
ST-Qwen2.5-1.5B
Speedup ratio=3.5×
2026.02
88.8
99.8
Qwen2.5-1.5B
Speedup ratio=1×
2026.02
86.3
98.3
ST-Qwen2.5-0.5B
Speedup ratio=3.1×
2026.02
79.8
99.8
Qwen2.5-0.5B
Speedup ratio=1×
2026.02
78.3
98.3
Feedback
Search any
task
Search any
task