| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Sudoku 512 tokens | d-TreeRPO | Pass@180.3 | 15 | 4d ago | |
| Sudoku 256 tokens | d-TreeRPO | Pass@192.9 | 15 | 4d ago | |
| Sudoku 2x2 | Rainbow | Final Reward1.3 | 14 | 4d ago | |
| Sudoku (test) | wd1 | Accuracy76.4 | 12 | 2d ago | |
| Visual Sudoku | Neural (RRN) | Board Accuracy99.8 | 12 | 3d ago | |
| Symbolic Sudoku | Neural (RRN) | Board Accuracy1 | 12 | 3d ago | |
| Sudoku | GPT-OSS-120B | Success Rate (pass@1)100 | 10 | 4d ago | |
| Sudoku | dXPP | Average per-epoch runtime (ms)12.15 | 9 | 4d ago | |
| 9x9 Sudoku (test) | Fine-tuned (solver order) | Cell Accuracy52 | 7 | 4d ago | |
| Sudoku 5x5 | NSAM(ours) | Final Reward2.7 | 7 | 4d ago | |
| Sudoku 4x4 | NSAM(ours) | Final Reward2.1 | 7 | 4d ago | |
| Sudoku 3x3 | NSAM(ours) | Final Reward160 | 7 | 4d ago | |
| Sudoku (17-givens) | Recurrent Relational Network | Accuracy96.6 | 7 | 4d ago | |
| SatNet Easy (test) | RRN | Solve Rate100 | 5 | 4d ago | |
| RRN Sudoku (test) | Recurrent Relational Network | Complete Puzzle Accuracy96.6 | 4 | 4d ago | |
| Sudoku 24-36-givens | Accuracy70 | 1 | 4d ago |