Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop Question Answering on Dragonball DragSingleZh (test)
Loading...
100
Recall
Qwen3-32B
12.4216
35.1583
57.895
80.6317
Jan 26, 2026
Recall
EIR
Updated 4d ago
Evaluation Results
Method
Method
Links
Recall
EIR
Qwen3-32B
category=LLM-in-contex...
2026.01
100
0.01
Gemini-2.5-Flash
category=LLM-in-contex...
2026.01
100
0.01
Gemini-2.5-Pro
category=LLM-in-contex...
2026.01
100
0.01
FABLE(docs)
selection_strategy=LLM...
2026.01
74.99
0.4
FABLE(llm-docs)
selection_strategy=LLM...
2026.01
74.61
5.19
FABLE(nodes)
selection_strategy=LLM...
2026.01
72.97
3.33
FABLE(llm-nodes)
selection_strategy=LLM...
2026.01
66.07
21.9
TreeRAG
category=Structure-Enh...
2026.01
57.31
18.38
HippoRAG2
category=Structure-Enh...
2026.01
28.08
5.48
LongRefiner
category=Structure-Enh...
2026.01
15.79
1.3
Feedback
Search any
task
Search any
task