Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tool usage simulation on ToolAlpaca evaluation

78.38Procedure Score

ToolCoder

14.544831.117447.6964.2626Dec 7, 2023Feb 18, 2024May 1, 2024Jul 13, 2024Sep 24, 2024Dec 6, 2024Feb 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
78.3875.6872.97
2023.12
778575
2025.02
76.4668.1167.94
2023.12
757874
2023.12
707370
2023.12
697365
2025.02
68.9258.1156.94
2025.02
68.0658.3357.92
2025.02
64.8660.8154.05
2023.12
636960
2023.12
192117
2023.12
173116