Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Use on ToolBench standard evaluation
Loading...
51.3
Pass Rate
ToolRLA
19.164
27.507
35.85
44.193
Mar 2, 2026
Pass Rate
Updated 3mo ago
Evaluation Results
Method
Method
Links
Pass Rate
ToolRLA
Backbone=Qwen3-14B
2026.03
51.3
Bloomberg AI Engineering
2026.03
48.2
GPT-4 (function calling)
2026.03
46.2
AvaTaR
2026.03
44.3
ToolLLM
2026.03
36.8
Gorilla
2026.03
20.4
Feedback
Search any
task
Search any
task