Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing
About
We address the challenge of lifting 2D visual segmentation to 3D in Gaussian Splatting. Existing methods often suffer from inconsistent 2D masks across viewpoints and produce noisy segmentation boundaries as they neglect these semantic cues to refine the learned Gaussians. To overcome this, we introduce Gaussian Instance Tracing (GIT), which augments the standard Gaussian representation with an instance weight matrix across input views. Leveraging the inherent consistency of Gaussians in 3D, we use this matrix to identify and correct 2D segmentation inconsistencies. Furthermore, since each Gaussian ideally corresponds to a single object, we propose a GIT-guided adaptive density control mechanism to split and prune ambiguous Gaussians during training, resulting in sharper and more coherent 2D and 3D segmentation boundaries. Experimental results show that our method extracts clean 3D assets and consistently improves 3D segmentation in both online (e.g., self-prompting) and offline (e.g., contrastive lifting) settings, enabling applications such as hierarchical segmentation, object extraction, and scene editing.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel-view Panoptic Segmentation | Neu3D flame salmon | mAcc (Pixel)86.25 | 5 | |
| Novel-view Panoptic Segmentation | Neu3D sear steak | mAcc (Pixel)86.99 | 5 | |
| Novel-view Panoptic Segmentation | Neu3D coffee martini | mAcc (Pixel)82.05 | 5 | |
| Novel-view Panoptic Segmentation | Neu3D cook spinach | mAcc (Pixel)83.84 | 5 | |
| Novel-view Panoptic Segmentation | Neu3D cut roasted beef | Pixel Accuracy (mAcc-pix)62.59 | 5 | |
| Novel-view Panoptic Segmentation | Neu3D flame steak | Pixel Acc73.62 | 5 | |
| Panoptic Segmentation | HyperNeRF americano | Pixel Accuracy95.81 | 5 | |
| Panoptic Segmentation | HyperNeRF split-cookie | mAcc (pix)93.79 | 5 | |
| Panoptic Segmentation | HyperNeRF chickchicken | Pixel Accuracy (mAcc)92.29 | 5 | |
| Panoptic Segmentation | HyperNeRF torchocolate | mAcc (Pixel)87.54 | 5 |