Share your thoughts, 1 month free Claude Pro on usSee more

Multi-hop Question Answering on Dragonball DragSingleZh (test)

100Recall

Qwen3-32B

Updated 4mo ago

Evaluation Results

Method	Links
Qwen3-32B 2026.01		100	0.01
Gemini-2.5-Flash 2026.01		100	0.01
Gemini-2.5-Pro 2026.01		100	0.01
FABLE(docs) 2026.01		74.99	0.4
FABLE(llm-docs) 2026.01		74.61	5.19
FABLE(nodes) 2026.01		72.97	3.33
FABLE(llm-nodes) 2026.01		66.07	21.9
TreeRAG 2026.01		57.31	18.38
HippoRAG2 2026.01		28.08	5.48
LongRefiner 2026.01		15.79	1.3