Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on HotpotQA (test) (EM/F1)
Loading...
68.75
EM
Graph-R1 + EKA
16.75
30.25
43.75
57.25
Dec 23, 2025
EM
F1
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
F1
Graph-R1 + EKA
Backbone=Qwen2.5-14B-I...
2025.12
68.75
74.47
StandardRAG
Backbone=GPT-4o-mini
2025.12
35.16
46.7
GraphRAG
Backbone=GPT-4o-mini
2025.12
19.53
31.67
NaiveGeneration
Backbone=GPT-4o-mini
2025.12
18.75
31.79
LightRAG
Backbone=GPT-4o-mini
2025.12
18.75
30.7
Feedback
Search any
task
Search any
task