Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Label-Efficient Semantic Segmentation with Diffusion Models

About

Denoising diffusion probabilistic models have recently received much research attention since they outperform alternative approaches, such as GANs, and currently provide state-of-the-art generative performance. The superior performance of diffusion models has made them an appealing tool in several applications, including inpainting, super-resolution, and semantic editing. In this paper, we demonstrate that diffusion models can also serve as an instrument for semantic segmentation, especially in the setup when labeled data is scarce. In particular, for several pretrained diffusion models, we investigate the intermediate activations from the networks that perform the Markov step of the reverse diffusion process. We show that these activations effectively capture the semantic information from an input image and appear to be excellent pixel-level representations for the segmentation problem. Based on these observations, we describe a simple segmentation method, which can work even if only a few training images are provided. Our approach significantly outperforms the existing alternatives on several datasets for the same amount of human supervision.

Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko• 2021

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K
mIoU40.26
1024
Image ClassificationFood-101
Accuracy73
542
Image ClassificationCIFAR-10
Accuracy84
507
Image ClassificationOxford-IIIT Pets
Accuracy75.9
306
Image ClassificationFGVC Aircraft--
203
Image ClassificationFlowers-102
Top-1 Acc70
198
Image ClassificationSTL-10
Accuracy87.2
129
Semantic segmentationGLAS
Dice0.9045
59
Semantic CorrespondenceSPair-71k
Φ_bbox @ α=0.166.73
29
Cell SegmentationMoNuSeg
AJI (Object)67.31
28
Showing 10 of 19 rows

Other info

Follow for update