Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ToolAlpaca

Benchmarks

Task NameDataset NameSOTA ResultTrend
Tool-use InferenceToolAlpaca
Match Rate5.26
22
Tool selectionToolAlpaca
Accuracy97.42
20
Tool usage simulationToolAlpaca evaluation
Procedure Score78.38
12
Showing 3 of 3 rows