Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Puzzle Solving on Sudoku In Distribution
Loading...
97.1
Average Score @128
Markov
-3.884
22.333
48.55
74.767
Mar 20, 2026
Average Score @128
Pass Rate @128
Updated 26d ago
Evaluation Results
Method
Method
Links
Average Score @128
Pass Rate @128
Markov
Model=Qwen3-4B, Traini...
2026.03
97.1
98
Action-sequence
Model=Qwen3-4B, Traini...
2026.03
93.5
97
State-action-sequence
Model=Qwen3-4B, Traini...
2026.03
91.1
96
Markov
Model=Qwen2.5-3B-It, T...
2026.03
86
94
State-action-sequence
Model=Qwen2.5-3B-It, T...
2026.03
83
90
Markov
Model=Qwen3-4B, Traini...
2026.03
34.2
100
State-action-sequence
Model=Qwen2.5-3B-It, T...
2026.03
22.4
100
Markov
Model=Qwen2.5-3B-It, T...
2026.03
20
99
Action-sequence
Model=Qwen3-4B, Traini...
2026.03
16.1
98
State-action-sequence
Model=Qwen3-4B, Traini...
2026.03
8.6
96
Action-sequence
Model=Qwen2.5-3B-It, T...
2026.03
0.3
23
Action-sequence
Model=Qwen2.5-3B-It, T...
2026.03
0
0
Feedback
Search any
task
Search any
task