Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image

About

We propose an algorithm to predict room layout from a single image that generalizes across panoramas and perspective images, cuboid layouts and more general layouts (e.g. L-shape room). Our method operates directly on the panoramic image, rather than decomposing into perspective images as do recent works. Our network architecture is similar to that of RoomNet, but we show improvements due to aligning the image based on vanishing points, predicting multiple layout elements (corners, boundaries, size and translation), and fitting a constrained Manhattan layout to the resulting predictions. Our method compares well in speed and accuracy to other existing work on panoramas, achieves among the best accuracy for perspective images, and can handle both cuboid-shaped and more general Manhattan layouts.

Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem• 2018

Related benchmarks

TaskDatasetResultRank
Cuboid layout estimationPanoContext (test)
3D IoU75.12
68
Cuboid layout estimationStanford 2D-3D (test)
3D IoU77.51
49
Room Layout EstimationMatterport3D official (test)
Overall 2D IoU0.7873
11
360° layout estimationStanford2D3D (test)
3D IoU76.33
11
Cuboid layout estimationStanford-2D3D
3D IoU77.51
7
Room Layout EstimationLSUN layout Challenge 9
Keypoint Error7.63
6
Room Layout EstimationHedau 11 (test)
Pixel Error9.69
5
Room Layout EstimationRealtor360 4 corners 1.0 (test)
2D IoU80.41
5
360° layout estimationRealtor360 (test)
Overall 2D IoU65.84
5
Room Layout EstimationRealtor360 1.0 (test)
2D IoU65.84
5
Showing 10 of 16 rows

Other info

Code

Follow for update