LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network
About
3D room layout estimation by a single panorama using deep neural networks has made great progress. However, previous approaches can not obtain efficient geometry awareness of room layout with the only latitude of boundaries or horizon-depth. We present that using horizon-depth along with room height can obtain omnidirectional-geometry awareness of room layout in both horizontal and vertical directions. In addition, we propose a planar-geometry aware loss function with normals and gradients of normals to supervise the planeness of walls and turning of corners. We propose an efficient network, LGT-Net, for room layout estimation, which contains a novel Transformer architecture called SWG-Transformer to model geometry relations. SWG-Transformer consists of (Shifted) Window Blocks and Global Blocks to combine the local and global geometry relations. Moreover, we design a novel relative position embedding of Transformer to enhance the spatial identification ability for the panorama. Experiments show that the proposed LGT-Net achieves better performance than current state-of-the-arts (SOTA) on benchmark datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cuboid layout estimation | PanoContext (test) | 3D IoU87.53 | 68 | |
| Cuboid layout estimation | Stanford 2D-3D (test) | 3D IoU86.03 | 49 | |
| Room Layout Estimation | MatterportLayout (test) | 2D IoU84.61 | 28 | |
| 360° layout estimation | Stanford2D3D (test) | 3D IoU85.76 | 11 | |
| Room Layout Estimation | ZInD | 2DIoU (%)91.77 | 9 | |
| Room Layout Estimation | ZInd (test) | 2D IoU (%)92.39 | 9 | |
| General Layout Estimation | MatterportLayout | 2DIoU0.8352 | 8 | |
| Cuboid layout estimation | Stanford 2D-3D raw (test) | 3D IoU86.03 | 6 | |
| Cuboid layout estimation | PanoContext raw (test) | 3D IoU0.8516 | 5 | |
| Room Layout Estimation | ZInD Simple | 2D IoU53.2 | 5 |