Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Selection on EnterpriseBench
Loading...
24
Tool Selection Accuracy
Gemini-2.5 Pro
10.48
13.99
17.5
21.01
Mar 23, 2026
Tool Selection Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Tool Selection Accuracy
Gemini-2.5 Pro
Model Category=Closed-...
2026.03
24
Claude-3.5-Sonnet
Model Category=Closed-...
2026.03
22
GPT-4o
Model Category=Closed-...
2026.03
21
Qwen3-8B Agentic GRPO
Model Category=Our Pla...
2026.03
21
Qwen3-8B SFT
Model Category=Our Pla...
2026.03
17
Qwen3-8B Base
Model Category=Open-So...
2026.03
14
xLAM-2-70B
Model Category=Open-So...
2026.03
12
ToolAce
Model Category=Open-So...
2026.03
11
Feedback
Search any
task
Search any
task