Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-Hop Question Answering on 2WikiMultihopQA (F1/EM)
Loading...
70.33
F1 Score
HippoRAG
5.7148
22.4899
39.265
56.0401
Apr 21, 2026
F1 Score
EM Score
Updated 14d ago
Evaluation Results
Method
Method
Links
F1 Score
EM Score
HippoRAG
LLM=GPT-4o
2026.04
70.33
61.16
QAFD-RAG
LLM=GPT-4o
2026.04
69.41
59.5
RAPTOR
LLM=GPT-4o
2026.04
38.8
12
GraphRAG
LLM=GPT-4o
2026.04
15.2
7
LightRAG
LLM=GPT-4o
2026.04
8.2
1
Feedback
Search any
task
Search any
task