Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TRASE: Tracking-free 4D Segmentation and Editing

About

Understanding dynamic 3D scenes is crucial for extended reality (XR) and autonomous driving. Incorporating semantic information into 3D reconstruction enables holistic scene representations, unlocking immersive and interactive applications. To this end, we introduce TRASE, a novel tracking-free 4D segmentation method for dynamic scene understanding. TRASE learns a 4D segmentation feature field in a weakly-supervised manner, leveraging a soft-mined contrastive learning objective guided by SAM masks. The resulting feature space is semantically coherent and well-separated, and final object-level segmentation is obtained via unsupervised clustering. This enables fast editing, such as object removal, composition, and style transfer, by directly manipulating the scene's Gaussians. We evaluate TRASE on five dynamic benchmarks, demonstrating state-of-the-art segmentation performance from unseen viewpoints and its effectiveness across various interactive editing tasks. Our project page is available at: https://yunjinli.github.io/project-sadg/

Yun-Jin Li, Mariia Gladkova, Yan Xia, Daniel Cremers• 2024

Related benchmarks

TaskDatasetResultRank
4D Gaussian Instance SegmentationHyperNeRF
Time (min)63.05
6
4D Gaussian Instance SegmentationNeu3D
Time (min)93.48
6
4D Scene SegmentationMulti-Human
mIoU0.696
5
4D Scene SegmentationSelfCap
mIoU72.3
5
4D Scene SegmentationNeural3DV
mIoU64.9
5
4D Gaussian Instance SegmentationNeu3D (test)
coffee_martini mIoU0.9147
3
Instance SegmentationHyperNeRF
Americano mIoU78.64
3
Showing 7 of 7 rows

Other info

Follow for update