Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation

About

We present Panoptic Neural Fields (PNF), an object-aware neural scene representation that decomposes a scene into a set of objects (things) and background (stuff). Each object is represented by an oriented 3D bounding box and a multi-layer perceptron (MLP) that takes position, direction, and time and outputs density and radiance. The background stuff is represented by a similar MLP that additionally outputs semantic labels. Each object MLPs are instance-specific and thus can be smaller and faster than previous object-aware approaches, while still leveraging category-specific priors incorporated via meta-learned initialization. Our model builds a panoptic radiance field representation of any scene from just color images. We use off-the-shelf algorithms to predict camera poses, object tracks, and 2D image semantic segmentations. Then we jointly optimize the MLP weights and bounding box parameters using analysis-by-synthesis with self-supervision from color images and pseudo-supervision from predicted semantic segmentations. During experiments with real-world dynamic scenes, we find that our model can be used effectively for several tasks like novel view synthesis, 2D panoptic segmentation, 3D scene editing, and multiview depth prediction.

Abhijit Kundu, Kyle Genova, Xiaoqi Yin, Alireza Fathi, Caroline Pantofaru, Leonidas Guibas, Andrea Tagliasacchi, Frank Dellaert, Thomas Funkhouser• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationKITTI-360
mIoU74.28
30
Semantic View Synthesis (Novel View)ScanNet V2 (val)
mIoU94.1
12
Driving Scene ReconstructionKITTI-360
PSNR23.06
10
3D Instance SegmentationScanNet
PQscene54.3
7
3D Instance SegmentationReplica3D
PQ Scene52.5
7
Novel View SynthesisKITTI-360
Appearance PSNR21.91
6
Image ReconstructionKITTI
PSNR27.48
6
Novel View Semantic SynthesisKITTI-360 novel view synthesis
Semantic mIoU0.7428
6
Semantic segmentationHyperSim v1 (test)
mIoU58.7
6
Semantic segmentationScanNet v1 (test)
mIoU58.7
6
Showing 10 of 23 rows

Other info

Follow for update