Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions

About

Scene understanding has been of high interest in computer vision. It encompasses not only identifying objects in a scene, but also their relationships within the given context. With this goal, a recent line of works tackles 3D semantic segmentation and scene layout prediction. In our work we focus on scene graphs, a data structure that organizes the entities of a scene in a graph, where objects are nodes and their relationships modeled as edges. We leverage inference on scene graphs as a way to carry out 3D scene understanding, mapping objects and their relationships. In particular, we propose a learned method that regresses a scene graph from the point cloud of a scene. Our novel architecture is based on PointNet and Graph Convolutional Networks (GCN). In addition, we introduce 3DSSG, a semi-automatically generated dataset, that contains semantically rich scene graphs of 3D scenes. We show the application of our method in a domain-agnostic retrieval task, where graphs serve as an intermediate representation for 3D-3D and 2D-3D matching.

Johanna Wald, Helisa Dhamo, Nassir Navab, Federico Tombari• 2020

Related benchmarks

Task	Dataset	Result
Predicate Classification (PredCls)	3DSSG	mR@5041.52	26
Predicate Classification (PredCls)	3DSSG (val)	Recall@2054.5	24
Scene Graph Classification (SGCls)	3DSSG (val)	Recall@2028.2	24
Scene Graph Classification (SGCls)	3DSSG	mR@200.197	22
3D scene graph generation	MA3DSG-Bench SCP setting 1.0 (test)	Triplet Recall@118.6	20
Triplet Prediction	3DSSG	mA@5052.74	15
Relationship Prediction	3RScan 3DSSG Geometric Segments 1.0 (test)	Recall@183	14
Panoptic video scene graph generation	PSG4D-GTA	R@202.29	12
Panoptic video scene graph generation	PSG4D HOI	Recall@200.0423	12
Predicate Detection	3RScan	R@394	10

Showing 10 of 34 rows

Other info

Follow for update

@wizwand_team Discord