Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Agentic Tool Use on Tool-Decathlon
Loading...
46.3
Score
GPT-5.2 (xhigh)
22.9
28.975
35.05
41.125
Feb 17, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
GPT-5.2 (xhigh)
2026.02
46.3
Claude Opus 4.5
2026.02
43.5
GLM-5
2026.02
39.2
Gemini 3 Pro
2026.02
36.4
DeepSeek-V3.2
2026.02
35.2
Kimi K2.5
2026.02
27.8
GLM-4.7
2026.02
23.8
Feedback
Search any
task
Search any
task