Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Word Puzzle Solving on Crosswords Letter
Loading...
23.4
Accuracy
RouteGoT
5.512
10.156
14.8
19.444
Mar 6, 2026
Accuracy
Average Input Tokens
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Average Input Tokens
RouteGoT
Model Pool={Qwen3-4B,...
2026.03
23.4
4,222
GoT*
Backbone=Qwen3-30B
2026.03
22.4
13,597
CoT
Backbone=Qwen3-30B
2026.03
22.2
182
IO
Backbone=Qwen3-30B
2026.03
22
179
EmbedLLM
Model Pool={Qwen3-4B,...
2026.03
19
4,846
RouteLLM
Model Pool={Qwen3-4B,...
2026.03
12.8
5,110
ToT
Backbone=Qwen3-30B
2026.03
12.4
790
AGoT
Backbone=Qwen3-30B
2026.03
11
9,522
RTR
Model Pool={Qwen3-4B,...
2026.03
9.4
3,777
Random
Model Pool={Qwen3-4B,...
2026.03
7.6
8,767
KNN
Model Pool={Qwen3-4B,...
2026.03
6.2
5,044
Feedback
Search any
task
Search any
task