Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Label-Efficient Semantic Segmentation with Diffusion Models

About

Denoising diffusion probabilistic models have recently received much research attention since they outperform alternative approaches, such as GANs, and currently provide state-of-the-art generative performance. The superior performance of diffusion models has made them an appealing tool in several applications, including inpainting, super-resolution, and semantic editing. In this paper, we demonstrate that diffusion models can also serve as an instrument for semantic segmentation, especially in the setup when labeled data is scarce. In particular, for several pretrained diffusion models, we investigate the intermediate activations from the networks that perform the Markov step of the reverse diffusion process. We show that these activations effectively capture the semantic information from an input image and appear to be excellent pixel-level representations for the segmentation problem. Based on these observations, we describe a simple segmentation method, which can work even if only a few training images are provided. Our approach significantly outperforms the existing alternatives on several datasets for the same amount of human supervision.

Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko• 2021

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K
mIoU40.26
936
Image ClassificationCIFAR-10
Accuracy84
507
Image ClassificationFood-101
Accuracy73
494
Image ClassificationOxford-IIIT Pets
Accuracy75.9
259
Image ClassificationFGVC Aircraft--
185
Image ClassificationFlowers-102
Top-1 Acc70
141
Image ClassificationSTL-10
Accuracy87.2
109
Semantic CorrespondenceSPair-71k
Φ_bbox @ α=0.166.73
29
Cell SegmentationMoNuSeg
AJI (Object)67.31
28
Semantic segmentationGLAS
Dice0.9045
28
Showing 10 of 19 rows

Other info

Follow for update