Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Calling on API-Bank Call
Loading...
34.7
Task Completion Rate
Qwen2.5-7B
3.916
11.908
19.9
27.892
Oct 8, 2025
Task Completion Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Task Completion Rate
Qwen2.5-7B
Schema configuration=P...
2025.10
34.7
Llama3.1-8B
Schema configuration=P...
2025.10
29.8
Qwen2.5-3B
Schema configuration=P...
2025.10
28.5
Llama3.1-8B
Schema configuration=Base
2025.10
28
Qwen2.5-7B
Schema configuration=Base
2025.10
25.7
Qwen2.5-3B
Schema configuration=Base
2025.10
18
Llama3.2-3B
Schema configuration=P...
2025.10
5.4
Llama3.2-3B
Schema configuration=Base
2025.10
5.1
Feedback
Search any
task
Search any
task