Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tool Learning on StableToolBench I1-Cat.

70.9SoPR

GPT-4 (Parallel)

30.13240.71651.361.884Jan 21, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.01
70.962.7
2025.01
6954.2
2025.01
68.161.4
2025.01
67.254.2
2025.01
65.860.1
2025.01
63.552.4
2025.01
56.541.8
2025.01
51.2-
2025.01
48.852.9
2025.01
43.939.9
2025.01
39.835.3
2025.01
38.634.6
2025.01
31.729.4