Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tool Learning on StableToolBench Average

70.3SoPR

GPT-4 (DFSDT)

23.535.6547.859.95Jan 21, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.01
70.364.2
2025.01
69.270.7
2025.01
66.765.5
2025.01
66.159.1
2025.01
6355.3
2025.01
61.953
2025.01
54.247.1
2025.01
48.258.7
2025.01
47.9-
2025.01
39.237.6
2025.01
37.939.3
2025.01
36.237.9
2025.01
25.327.3