Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Tool-use evaluation on MCPToolBench++

81.8Precision

Llama-3.3-70B

61.20866.55471.977.246Apr 11, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.04
81.882.782.181
77.58078.375.3
2025.04
77.380.778.474
7576.375.374.3
2025.04
72.575.373.469.7
2025.04
6768.367.465.7
2025.04
6265.363.158.7