Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-hop Reasoning on QASPER
Loading...
15
EM
MergeRAG-Sym
7.2
9.225
11.25
13.275
Mar 18, 2026
EM
F1 Score
Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
EM
F1 Score
Accuracy
MergeRAG-Sym
2026.03
15
38.4
17.5
MergeRAG-Asym
2026.03
12.5
36.2
18
RECOMP
2026.03
11.5
26.3
14
RAPTOR
2026.03
11.5
30.9
13.5
BGE-reranker
2026.03
10
31
13
BM25
2026.03
9.5
23
10.5
Tree-RAG
2026.03
7.5
17.5
8.1
Feedback
Search any
task
Search any
task