Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Seal-Tools

Benchmarks

Task NameDataset NameSOTA ResultTrend
Tool CallingSeal-Tools Single-Tool
Name Match Score98.14
30
Tool CallingSeal-Tools Single-Tool v1 (test)
F1 Name98.14
12
Function CallingSeal-Tools
F1 Score94.94
11
Multi-tool callingSEAL-Tools
TSA86.5
9
Tool RetrievalSeal-Tools (test)
Recall@588.4
5
Tool SelectionSeal-Tools (test)
Top-1 Acc99.9
2
Agent-team recommendationSeal-Tools (test)
Retrieval Success (Top-10)99.6
2
Showing 7 of 7 rows