Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-ended Question Answering on MoreHopQA (test)
Loading...
77
Accuracy
RouteGoT
23.96
37.73
51.5
65.27
Mar 6, 2026
Accuracy
Average Output Tokens
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Average Output Tokens
RouteGoT
Backbone={Qwen3-4B, 8B...
2026.03
77
665
GoT*
Backbone=Qwen3-30B
2026.03
73
814
RTR
Backbone={Qwen3-4B, 8B...
2026.03
73
626
KNN
Backbone={Qwen3-4B, 8B...
2026.03
71
624
RouteLLM
Backbone={Qwen3-4B, 8B...
2026.03
71
688
AGoT
Backbone=Qwen3-30B
2026.03
70
2,064
CoT
Backbone=Qwen3-30B
2026.03
68
727
EmbedLLM
Backbone={Qwen3-4B, 8B...
2026.03
65
2,621
Random
Backbone={Qwen3-4B, 8B...
2026.03
64
1,516
ToT
Backbone=Qwen3-30B
2026.03
61
3,860
IO
Backbone=Qwen3-30B
2026.03
26
9
Feedback
Search any
task
Search any
task