Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Sudoku

Benchmarks

Task NameDataset NameSOTA ResultTrend
PlanningSudoku
Accuracy92.7
68
Logical ReasoningSudoku
Accuracy89.2
44
SudokuSudoku 4x4 (test)
Accuracy (Seq Len 64)17.5
18
Agent TaskSudoku
Success Rate (SR)99
17
Agent Behavior AdaptationSudoku (Su) (test)
Loop Ratio34.3
17
Sudoku SolvingSudoku 512 tokens
Pass@180.3
15
Sudoku SolvingSudoku 256 tokens
Pass@192.9
15
Sudoku SolvingSudoku 2x2
Final Reward1.3
14
Sudoku SolvingSudoku (test)
Accuracy76.4
12
Constraint SatisfactionSudoku
CSP Result Index 3557
12
Sudoku SolvingSudoku
Success Rate (pass@1)100
10
Sequential puzzle-solvingSudoku
Accuracy44.2
9
Sudoku SolvingSudoku
Average per-epoch runtime (ms)12.15
9
ReasoningSudoku (test)
Accuracy0.161
9
Sudoku Solving9x9 Sudoku (test)
Cell Accuracy52
7
Sudoku SolvingSudoku 5x5
Final Reward2.7
7
Sudoku SolvingSudoku 4x4
Final Reward2.1
7
Sudoku SolvingSudoku 3x3
Final Reward160
7
Sudoku SolvingSudoku (17-givens)
Accuracy96.6
7
Symbolic planning4x4 Sudoku
Accuracy (Ngen=128)26.6
6
ReasoningSudoku
Avg Diffusion Steps38.3
6
Logical ReasoningSudoku 9x9
Accuracy0.11
5
Sudoku SolvingSudoku 24-36-givens
Accuracy70
1
Showing 23 of 23 rows