PuzLM: Solving Jigsaw Puzzles with Sequence-to-Sequence Language Models
About
Square jigsaw puzzles are typically solved by visually matching piece images to recover the original layout. This work introduces PuzLM, an alternative perspective that recasts jigsaw reassembly as a discrete sequence-to-sequence (Seq2Seq) problem, inspired by natural language representations. We design an efficient puzzle quantization procedure that transforms each piece into a short sequence of discrete tokens, enabling the direct application of standard Seq2Seq language models as powerful jigsaw solvers. Our approach demonstrates that accurate puzzle reconstruction can be achieved through purely symbolic reasoning over discrete representations, improving state-of-the-art performance even on puzzles with eroded boundaries or missing pieces.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Jigsaw puzzle solving | JPwLEG-5 | Absolute Score72.1 | 9 | |
| Jigsaw puzzle solving | JPwLEG-3 | Abs. Score91.9 | 8 | |
| Jigsaw puzzle solving | ImageNet 3x3 (test) | Absolute Accuracy92.2 | 5 | |
| Jigsaw puzzle solving | ImageNet 3x3 (1/9 pieces missing) | Absolute Score86 | 2 | |
| Jigsaw puzzle solving | ImageNet 3x3 (2/9 pieces missing) | Absolute Accuracy73.8 | 2 | |
| Jigsaw puzzle solving | ImageNet 3x3 (3/9 pieces missing) | Absolute Score61.2 | 2 |