Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tool Learning on StableToolBench I2-Inst.

73.4SoPR

GPT-4 (Parallel)

21.08834.66948.2561.831Jan 21, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.01
73.485.8
2025.01
70.873.6
2025.01
62.170.8
2025.01
60.255.6
2025.01
57.172.6
2025.01
54.955.7
2025.01
50.669.8
2025.01
49.753.8
2025.01
39.949.1
2025.01
37.6-
2025.01
37.545.6
2025.01
3647.2
2025.01
23.132.1