Share your thoughts, 1 month free Claude Pro on usSee more

End-to-End Question Answering on MCP-Bench

87.5Accuracy (Human)

TURA

Updated 4mo ago

Evaluation Results

Method	Links
TURA 2025.08		87.5	88.3	96.2	97.1
Tool-Agent 2025.08		76.8	80.4	81.7	83.9
Dynamic RAG 2025.08		67.2	69.5	77.6	79.4
LLM + RAG 2025.08		65.3	68.1	72.4	75