Raster2Seq: Polygon Sequence Generation for Floorplan Reconstruction
About
Reconstructing a structured vector-graphics representation from a rasterized floorplan image is typically an important prerequisite for computational tasks involving floorplans such as automated understanding or CAD workflows. However, existing techniques struggle in faithfully generating the structure and semantics conveyed by complex floorplans that depict large indoor spaces with many rooms and a varying numbers of polygon corners. To this end, we propose Raster2Seq, framing floorplan reconstruction as a sequence-to-sequence task in which floorplan elements--such as rooms, windows, and doors--are represented as labeled polygon sequences that jointly encode geometry and semantics. Our approach introduces an autoregressive decoder that learns to predict the next corner conditioned on image features and previously generated corners using guidance from learnable anchors. These anchors represent spatial coordinates in image space, hence allowing for effectively directing the attention mechanism to focus on informative image regions. By embracing the autoregressive mechanism, our method offers flexibility in the output format, enabling for efficiently handling complex floorplans with numerous rooms and diverse polygon structures. Our method achieves state-of-the-art performance on standard benchmarks such as Structure3D, CubiCasa5K, and Raster2Graph, while also demonstrating strong generalization to more challenging datasets like WAFFLE, which contain diverse room structures and complex geometric variations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Floorplan Reconstruction | Structured3D density map input (test) | Room Precision99.4 | 11 | |
| Floorplan Reconstruction | Structured3D binary (test) | Room F199.6 | 10 | |
| Floorplan Reconstruction | Raster2Graph | Room F197 | 6 | |
| Floorplan Reconstruction | CubiCasa5K | Room F188.7 | 5 | |
| Geometric Floorplan Reconstruction | Raster2Graph 16 (test) | Room Precision0.972 | 5 | |
| Floorplan Reconstruction Efficiency | Raster2Graph | Sampling time (s)0.52 | 5 | |
| Floorplan interior segmentation | WAFFLE (test) | IoU73.9 | 4 | |
| Floorplan Reconstruction | CubiCasa5K (test) | Room Precision89.3 | 4 | |
| Semantic Floorplan Parsing | Raster2Graph 16 (test) | Room Semantic Precision85.3 | 3 | |
| Floorplan Semantic Segmentation | Structured3D 50 (test) | Room Semantic Prec.76.8 | 2 |