Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation

About

Segmenting an image into its parts is a frequent preprocess for high-level vision tasks such as image editing. However, annotating masks for supervised training is expensive. Weakly-supervised and unsupervised methods exist, but they depend on the comparison of pairs of images, such as from multi-views, frames of videos, and image augmentation, which limits their applicability. To address this, we propose a GAN-based approach that generates images conditioned on latent masks, thereby alleviating full or weak annotations required in previous approaches. We show that such mask-conditioned image generation can be learned faithfully when conditioning the masks in a hierarchical manner on latent keypoints that define the position of parts explicitly. Without requiring supervision of masks or points, this strategy increases robustness to viewpoint and object positions changes. It also lets us generate image-mask pairs for training a segmentation network, which outperforms the state-of-the-art unsupervised segmentation methods on established benchmarks.

Xingzhe He, Bastian Wandt, Helge Rhodin• 2021

Related benchmarks

TaskDatasetResultRank
Unsupervised Object SegmentationCUB
Jaccard Index62.9
16
Landmark DetectionCelebA Wild (K=8) (test)
Normalized L2 Distance (%)6.18
14
Landmark DetectionCUB Category 001 2011 (test)
Normalized L2 Distance22.1
12
Landmark DetectionCUB Category 002 2011 (test)
Normalized L2 Distance22.3
12
Landmark DetectionCelebA Wild (K=4) (test)
Normalized L2 Distance12.26
10
Landmark DetectionCUB-003
Normalized L2 Distance0.215
9
Landmark DetectionCelebA Aligned (K=10) (test)
Norm L2 Dist (%)3.98
9
Foreground segmentationCUB-2011
IoU62.9
6
Landmark DetectionCUB (all)
Normalized L2 Distance12.1
6
Landmark DetectionCUB aligned
Normalized L2 Distance3.23
5
Showing 10 of 13 rows

Other info

Code

Follow for update