Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs

About

Bridging natural language and 3D geometry is a crucial step toward flexible, language-driven scene understanding. While recent advances in 3D Gaussian Splatting (3DGS) have enabled fast and high-quality scene reconstruction, research has also explored incorporating open-vocabulary understanding into 3DGS. However, most existing methods require iterative optimization over per-view 2D semantic feature maps, which not only results in inefficiencies but also leads to inconsistent 3D semantics across views. To address these limitations, we introduce a training-free framework that constructs a superpoint graph directly from Gaussian primitives. The superpoint graph partitions the scene into spatially compact and semantically coherent regions, forming view-consistent 3D entities and providing a structured foundation for open-vocabulary understanding. Based on the graph structure, we design an efficient reprojection strategy that lifts 2D semantic features onto the superpoints, avoiding costly multi-view iterative training. The resulting representation ensures strong 3D semantic coherence and naturally supports hierarchical understanding, enabling both coarse- and fine-grained open-vocabulary perception within a unified semantic field. Extensive experiments demonstrate that our method achieves state-of-the-art open-vocabulary segmentation performance, with semantic field reconstruction completed over $30\times$ faster. Our code will be available at https://github.com/Atrovast/THGS.

Shaohui Dai, Yansong Qu, Zheyan Li, Xinyang Li, Shengchuan Zhang, Liujuan Cao• 2025

Related benchmarks

TaskDatasetResultRank
3D Semantic SegmentationScanNet 10 classes
mIoU46.38
17
3D Semantic SegmentationScanNet 15 classes
mIoU39.61
17
Semantic segmentationScanNet 19 classes
mIoU34.39
13
Open Vocabulary Semantic SegmentationLERF-OVS
mIoU54.9
12
Relationship-Guided 3D Instance SegmentationScanNet++
mIoU29
7
Open-Vocabulary Segmentation3D-OVS corrected (test)
mIoU (Bed)68.9
5
Showing 6 of 6 rows

Other info

Follow for update