HorizonNet: Learning Room Layout with 1D Representation and Pano Stretch Data Augmentation
About
We present a new approach to the problem of estimating the 3D room layout from a single panoramic image. We represent room layout as three 1D vectors that encode, at each image column, the boundary positions of floor-wall and ceiling-wall, and the existence of wall-wall boundary. The proposed network, HorizonNet, trained for predicting 1D layout, outperforms previous state-of-the-art approaches. The designed post-processing procedure for recovering 3D room layouts from 1D predictions can automatically infer the room shape with low computation cost - it takes less than 20ms for a panorama image while prior works might need dozens of seconds. We also propose Pano Stretch Data Augmentation, which can diversify panorama data and be applied to other panorama-related learning tasks. Due to the limited data available for non-cuboid layout, we relabel 65 general layout from the current dataset for finetuning. Our approach shows good performance on general layouts by qualitative results and cross-validation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cuboid layout estimation | PanoContext (test) | 3D IoU84.23 | 68 | |
| Cuboid layout estimation | Stanford 2D-3D (test) | 3D IoU82.79 | 49 | |
| Room Layout Estimation | MatterportLayout (test) | 2D IoU82.85 | 28 | |
| Room Layout Estimation | Matterport3D official (test) | Overall 2D IoU0.8124 | 11 | |
| 360° layout estimation | Stanford2D3D (test) | 3D IoU83.51 | 11 | |
| Non-cuboid layout estimation | MatterportLayout non-cuboid (test) | 3D IoU79.12 | 10 | |
| Room Layout Estimation | ZInD | 2DIoU (%)90.44 | 9 | |
| Room Layout Estimation | MP3D-FPE (test) | 2D IoU65.38 | 9 | |
| Room Layout Estimation | ZInd (test) | 2D IoU (%)91.37 | 9 | |
| 360° layout estimation | Matterport3D | 2D IoU81.24 | 8 |