Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Function Calling on BFCL Extended Setting (Non-Live)
Loading...
74.92
Simple Success Rate
GT_Funs
15.64
31.03
46.42
61.81
Mar 12, 2026
Simple Success Rate
Multiple Success Rate
Parallel Success Rate
Parallel Multiple Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Simple Success Rate
Multiple Success Rate
Parallel Success Rate
Parallel Multiple Success Rate
GT_Funs
Model=Qwen2.5-7B-Instr...
2026.03
74.92
95
90.5
85.5
Tool-DC (TF)
Model=Qwen2.5-7B-Instr...
2026.03
73.17
89
88.5
79.5
All_Funs
Model=Qwen2.5-7B-Instr...
2026.03
68.67
85.5
84
71
GT_Funs
Model=InternLM2.5-7B-C...
2026.03
51.92
85
70.5
51
Tool-DC (TF)
Model=InternLM2.5-7B-C...
2026.03
41
65
46
28.5
All_Funs
Model=InternLM2.5-7B-C...
2026.03
17.92
26
25
10.5
Feedback
Search any
task
Search any
task