Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Calling on ToolCall-15
Loading...
63
Accuracy
Nemotron-Super-120B
26.6
36.05
45.5
54.95
May 16, 2026
Accuracy
Updated 15d ago
Evaluation Results
Method
Method
Links
Accuracy
Nemotron-Super-120B
Deployment=Local model
2026.05
63
Best Local
Config=Oracle local ro...
2026.05
63
Qwen3.5-35B
Deployment=Local model
2026.05
60
Qwen3.5-122B
Deployment=Local model
2026.05
60
Claude Opus 4.6
Deployment=Cloud baseline
2026.05
53.3
Gemini 3.1 Pro
Deployment=Cloud baseline
2026.05
53.3
Qwen3.5-9B
Deployment=Local model
2026.05
53.3
Qwen3.5-27B
Deployment=Local model
2026.05
53.3
Granite 3.3 8B
Deployment=Local model
2026.05
49
GPT 5.4
Deployment=Cloud baseline
2026.05
46.6
Granite 4.0 H-Small
Deployment=Local model
2026.05
42
Gemma4-E4B
Deployment=Local model
2026.05
28
Gemma4-26B
Deployment=Local model
2026.05
28
Feedback
Search any
task
Search any
task