Object Segmentation Without Labels with Large-Scale Generative Models
About
The recent rise of unsupervised and self-supervised learning has dramatically reduced the dependency on labeled data, providing effective image representations for transfer to downstream vision tasks. Furthermore, recent works employed these representations in a fully unsupervised setup for image classification, reducing the need for human labels on the fine-tuning stage as well. This work demonstrates that large-scale unsupervised models can also perform a more challenging object segmentation task, requiring neither pixel-level nor image-level labeling. Namely, we show that recent unsupervised GANs allow to differentiate between foreground/background pixels, providing high-quality saliency masks. By extensive comparison on standard benchmarks, we outperform existing unsupervised alternatives for object segmentation, achieving new state-of-the-art.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Salient Object Detection | DUTS (test) | -- | 302 | |
| Salient Object Detection | ECSSD | -- | 202 | |
| Salient Object Detection | ECSSD 1,000 images (test) | -- | 48 | |
| Saliency Detection | DUT-OMRON 29 (test) | IoU46.4 | 38 | |
| RGB saliency detection | ECSSD | F-measure (F_beta)79 | 25 | |
| Saliency Detection | DUTS (test) | IoU51.1 | 22 | |
| Saliency Detection | ECSSD 31 (test) | mIoU0.684 | 20 | |
| Saliency Detection | DUTS 30 (test) | IoU51.1 | 20 | |
| Unsupervised Object Segmentation | CUB | Jaccard Index71 | 16 | |
| Saliency Detection | DUTS | J-measure51.1 | 13 |