Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Use on BFCL Live Parallel
Loading...
56.25
BFCL Official Success Rate
Qwen2.5-7B
-2.25
12.9375
28.125
43.3125
Jun 1, 2026
BFCL Official Success Rate
Updated 23h ago
Evaluation Results
Method
Method
Links
BFCL Official Success Rate
Qwen2.5-7B
Backbone=Qwen2.5-7B-Base
2026.06
56.25
SimpleRL
Backbone=Qwen2.5-7B-Base
2026.06
56.25
Reasoner
Backbone=Qwen2.5-7B-Base
2026.06
56.25
ISO-C
Backbone=Qwen2.5-7B-Base
2026.06
56.25
ISO-CTS
Backbone=Qwen2.5-7B-Base
2026.06
56.25
RESMERGE
Backbone=Qwen2.5-7B-Ba...
2026.06
56.25
TA
Backbone=Qwen2.5-7B-Base
2026.06
50
TSV-Merge
Backbone=Qwen2.5-7B-Base
2026.06
50
Zero
Backbone=Qwen2.5-7B-Base
2026.06
6.25
TIES
Backbone=Qwen2.5-7B-Base
2026.06
0
DARE + TIES
Backbone=Qwen2.5-7B-Base
2026.06
0
RAM
Backbone=Qwen2.5-7B-Base
2026.06
0
Feedback
Search any
task
Search any
task