Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TRASE: Tracking-free 4D Segmentation and Editing

About

Understanding dynamic 3D scenes is crucial for extended reality (XR) and autonomous driving. Incorporating semantic information into 3D reconstruction enables holistic scene representations, unlocking immersive and interactive applications. To this end, we introduce TRASE, a novel tracking-free 4D segmentation method for dynamic scene understanding. TRASE learns a 4D segmentation feature field in a weakly-supervised manner, leveraging a soft-mined contrastive learning objective guided by SAM masks. The resulting feature space is semantically coherent and well-separated, and final object-level segmentation is obtained via unsupervised clustering. This enables fast editing, such as object removal, composition, and style transfer, by directly manipulating the scene's Gaussians. We evaluate TRASE on five dynamic benchmarks, demonstrating state-of-the-art segmentation performance from unseen viewpoints and its effectiveness across various interactive editing tasks. Our project page is available at: https://yunjinli.github.io/project-sadg/

Yun-Jin Li, Mariia Gladkova, Yan Xia, Daniel Cremers• 2024

Related benchmarks

TaskDatasetResultRank
4D Scene SegmentationNeural3DV
mIoU (coffee_martini)91.2
8
4D Gaussian Instance SegmentationHyperNeRF
Time (min)63.05
6
4D Gaussian Instance SegmentationNeu3D
Time (min)93.48
6
4D Scene SegmentationMulti-Human
mIoU0.696
5
Photometric RenderingNeural3DV
PSNR29.29
5
4D Scene SegmentationSelfCap
mIoU72.3
5
Photometric RenderingPanoptic Studio
PSNR26.1
5
Novel View SynthesisST-NeRF basketball
PSNR33.421
4
4D Gaussian Instance SegmentationNeu3D (test)
coffee_martini mIoU0.9147
3
Instance SegmentationPanoptic Studio
Basketball mIoU57.16
3
Showing 10 of 15 rows

Other info

Follow for update