Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering over Tables and Text on HybridQA
Loading...
91
Accuracy
ToT
63.96
70.98
78
85.02
Mar 6, 2026
Accuracy
Average Input Tokens
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Average Input Tokens
ToT
Backbone=Qwen3-30B
2026.03
91
12,355
RouteGoT
Model Pool={Qwen3-4B,...
2026.03
91
14,743
GoT*
Backbone=Qwen3-30B
2026.03
88
65,134
CoT
Backbone=Qwen3-30B
2026.03
84
10,239
AGoT
Backbone=Qwen3-30B
2026.03
84
112,953
Random
Model Pool={Qwen3-4B,...
2026.03
70
21,138
KNN
Model Pool={Qwen3-4B,...
2026.03
68
20,686
RTR
Model Pool={Qwen3-4B,...
2026.03
68
20,839
IO
Backbone=Qwen3-30B
2026.03
67
10,462
EmbedLLM
Model Pool={Qwen3-4B,...
2026.03
66
21,875
RouteLLM
Model Pool={Qwen3-4B,...
2026.03
65
20,652
Feedback
Search any
task
Search any
task