Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

About

We introduce the task of predicting functional 3D scene graphs for real-world indoor environments from posed RGB-D images. Unlike traditional 3D scene graphs that focus on spatial relationships of objects, functional 3D scene graphs capture objects, interactive elements, and their functional relationships. Due to the lack of training data, we leverage foundation models, including visual language models (VLMs) and large language models (LLMs), to encode functional knowledge. We evaluate our approach on an extended SceneFun3D dataset and a newly collected dataset, FunGraph3D, both annotated with functional 3D scene graphs. Our method significantly outperforms adapted baselines, including Open3DSG and ConceptGraph, demonstrating its effectiveness in modeling complex scene functionalities. We also demonstrate downstream applications such as 3D question answering and robotic manipulation using functional 3D scene graphs. See our project page at https://openfungraph.github.io

Chenyangguang Zhang, Alexandros Delitzas, Fangjinhua Wang, Ruida Zhang, Xiangyang Ji, Marc Pollefeys, Francis Engelmann• 2025

Related benchmarks

TaskDatasetResultRank
3D Object LocalizationGOAT-Core Scene Tee
Average SR@573.3
6
3D Object LocalizationGOAT-Core Scene 4ok
Average SR@577.5
6
3D Object LocalizationGOAT-Core Scene 5cd
Average SR@557.5
6
3D Object LocalizationGOAT-Core (Total)
Average SR@563.1
6
3D Object LocalizationGOAT-Core Scene Nfv
Average SR@544.2
6
Scene ReconstructionSceneFun3D
Objects R@381.8
5
Scene ReconstructionFunGraph3D
Objects R@370.7
5
Functional Scene Graph ConstructionBehavior-1k Simulation
Recall39.7
4
Functional Scene Graph ConstructionReal-world
Recall45.7
4
3D functional elements segmentationFunGraph3D
R@3 (IoU > 0.0)45.34
3
Showing 10 of 13 rows

Other info

Follow for update