Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

In-Place Scene Labelling and Understanding with Implicit Scene Representation

About

Semantic labelling is highly correlated with geometry and radiance reconstruction, as scene entities with similar shape and appearance are more likely to come from similar classes. Recent implicit neural reconstruction techniques are appealing as they do not require prior training data, but the same fully self-supervised approach is not possible for semantics because labels are human-defined properties. We extend neural radiance fields (NeRF) to jointly encode semantics with appearance and geometry, so that complete and accurate 2D semantic labels can be achieved using a small amount of in-place annotations specific to the scene. The intrinsic multi-view consistency and smoothness of NeRF benefit semantics by enabling sparse labels to efficiently propagate. We show the benefit of this approach when labels are either sparse or very noisy in room-scale scenes. We demonstrate its advantageous properties in various interesting applications such as an efficient scene labelling tool, novel semantic view synthesis, label denoising, super-resolution, label interpolation and multi-view semantic label fusion in visual semantic mapping systems.

Shuaifeng Zhi, Tristan Laidlow, Stefan Leutenegger, Andrew J. Davison• 2021

Related benchmarks

TaskDatasetResultRank
Semantic segmentationScanNet--
59
Novel View SynthesisScanNet
PSNR28.43
58
Semantic View Synthesis (Novel View)ScanNet V2 (val)
mIoU96.8
12
Semantic segmentationReplica synthetic (test)
Total Acc94.36
9
Semantic segmentationScanNet real (test)
Total Accuracy97.54
9
3D Open-vocabulary SegmentationLERF-style Dataset bench scene (test)
mIoU94.2
8
3D Open-vocabulary SegmentationLERF-style Dataset bed scene (test)
mIoU89.3
8
3D Open-vocabulary SegmentationLERF-style Dataset sofa scene (test)
mIoU66.3
8
3D Open-vocabulary SegmentationLERF-style Dataset lawn scene (test)
mIoU87.6
8
3D Open-vocabulary SegmentationLERF-style Dataset table scene (test)
mIoU83.8
8
Showing 10 of 31 rows

Other info

Follow for update