Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-hop Question Answering on MuSiQue (% Cost Saving)
Loading...
159.2
Cost Saving
Adaptive-RAG (Extended)
-6.368
36.616
79.6
122.584
May 26, 2026
Cost Saving
Updated 7d ago
Evaluation Results
Method
Method
Links
Cost Saving
Adaptive-RAG (Extended)
Accuracy Goal=95%
2026.05
159.2
Adaptive-RAG (Extended)
Accuracy Goal=90%
2026.05
159.2
CARROT-RoBERTa (LLM-only)
Accuracy Goal=90%
2026.05
92.3
BRANE
Accuracy Goal=100%
2026.05
89.4
CARROT-KNN (Extended)
Accuracy Goal=95%
2026.05
77
CARROT-RoBERTa (Extended)
Accuracy Goal=95%
2026.05
59.2
CARROT-KNN (Extended)
Accuracy Goal=90%
2026.05
57.5
CARROT-RoBERTa (Extended)
Accuracy Goal=90%
2026.05
44.6
BRANE
Accuracy Goal=90%
2026.05
40.7
BRANE
Accuracy Goal=95%
2026.05
17.3
CARROT-RoBERTa (Extended)
Accuracy Goal=100%
2026.05
2.4
Most Accurate Static (Murakkab)
Accuracy Goal=100%
2026.05
0
Most Accurate Static (Murakkab)
Accuracy Goal=95%
2026.05
0
Most Accurate Static (Murakkab)
Accuracy Goal=90%
2026.05
0
Feedback
Search any
task
Search any
task