Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Function Calling on BFCL Exec v3
Loading...
94.6
Overall Accuracy
Qwen3-4B
86.28
88.44
90.6
92.76
Feb 4, 2026
Overall Accuracy
Function Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Overall Accuracy
Function Accuracy
Qwen3-4B
Speedup ratio=1×
2026.02
94.6
97.3
Qwen2.5-7B
Speedup ratio=1×
2026.02
94
99.3
Qwen2.5-14B
Speedup ratio=1×
2026.02
94
98.7
ST-Qwen2.5-7B
Speedup ratio=4.5×
2026.02
92.6
100
Qwen2.5-1.5B
Speedup ratio=1×
2026.02
91.9
98
ST-Qwen2.5-3B
Speedup ratio=3.8×
2026.02
91.3
100
ST-Qwen2.5-14B
Speedup ratio=4.4×
2026.02
91.3
100
ST-Qwen2.5-1.5B
Speedup ratio=3.5×
2026.02
90.6
100
ST-Qwen3-4B
Speedup ratio=3.9×
2026.02
89.9
100
Qwen2.5-0.5B
Speedup ratio=1×
2026.02
88.6
96.6
ST-Qwen2.5-0.5B
Speedup ratio=3.1×
2026.02
87.9
100
Qwen2.5-3B
Speedup ratio=1×
2026.02
86.6
96
Feedback
Search any
task
Search any
task