Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tool Calling on ToolBench Generalization (I1-Tool)

57.7SoPR

ToolLlama*

27.311235.200643.0950.9794Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
57.748.73
2026.01
57.5946.2
2026.01
56.5440.51
2026.01
54.8536.08
2026.01
53.1649.37
2026.01
45.3632.91
2026.01
28.4826.58