Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a Single RGB Panorama

About

We present a deep learning framework, called DuLa-Net, to predict Manhattan-world 3D room layouts from a single RGB panorama. To achieve better prediction accuracy, our method leverages two projections of the panorama at once, namely the equirectangular panorama-view and the perspective ceiling-view, that each contains different clues about the room layouts. Our network architecture consists of two encoder-decoder branches for analyzing each of the two views. In addition, a novel feature fusion structure is proposed to connect the two branches, which are then jointly trained to predict the 2D floor plans and layout heights. To learn more complex room layouts, we introduce the Realtor360 dataset that contains panoramas of Manhattan-world room layouts with different numbers of corners. Experimental results show that our work outperforms recent state-of-the-art in prediction accuracy and performance, especially in the rooms with non-cuboid layouts.

Shang-Ta Yang, Fu-En Wang, Chi-Han Peng, Peter Wonka, Min Sun, Hung-Kuo Chu• 2018

Related benchmarks

TaskDatasetResultRank
Cuboid layout estimationPanoContext (test)
3D IoU77.42
68
Cuboid layout estimationStanford 2D-3D (test)
3D IoU79.63
49
Room Layout EstimationMatterportLayout (test)
2D IoU78.82
28
Room Layout EstimationMatterport3D official (test)
Overall 2D IoU0.7882
11
360° layout estimationStanford2D3D (test)
3D IoU79.36
11
Cuboid layout estimationStanford-2D3D
3D IoU79.36
7
3D Structure EstimationStanford3D
2D IoU (All)75.07
6
Room Layout EstimationRealtor360 1.0 (test)
2D IoU80.53
5
Room Layout EstimationRealtor360 4 corners 1.0 (test)
2D IoU82.63
5
Room Layout EstimationRealtor360 6 corners 1.0 (test)
2D IoU80.72
5
Showing 10 of 15 rows

Other info

Follow for update