Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rewis3d: Reconstruction Improves Weakly-Supervised Semantic Segmentation

About

We present Rewis3d, a framework that leverages recent advances in feed-forward 3D reconstruction to significantly improve weakly supervised semantic segmentation on 2D images. Obtaining dense, pixel-level annotations remains a costly bottleneck for training segmentation models. Alleviating this issue, sparse annotations offer an efficient weakly-supervised alternative. However, they still incur a performance gap. To address this, we introduce a novel approach that leverages 3D scene reconstruction as an auxiliary supervisory signal. Our key insight is that 3D geometric structure recovered from 2D videos provides strong cues that can propagate sparse annotations across entire scenes. Specifically, a dual student-teacher architecture enforces semantic consistency between 2D images and reconstructed 3D point clouds, using state-of-the-art feed-forward reconstruction to generate reliable geometric supervision. Extensive experiments demonstrate that Rewis3d achieves state-of-the-art performance in sparse supervision, outperforming existing approaches by 2-7% without requiring additional labels or inference overhead.

Jonas Ernst, Wolfgang Boettcher, Lukas Hoyer, Jan Eric Lenssen, Bernt Schiele• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes
mIoU68.6
658
Semantic segmentationKITTI-360
mIoU63.4
36
Semantic segmentationWaymo Open Dataset
mIoU53.3
6
Semantic segmentationNYU V2
mIoU46.1
6
3D Semantic SegmentationWaymo
mIoU45.5
2
3D Semantic SegmentationKITTI-360
mIoU44.9
2
3D Semantic SegmentationNYU V2
mIoU28.5
2
Showing 7 of 7 rows

Other info

Follow for update