A Probabilistic U-Net for Segmentation of Ambiguous Images

About

Many real-world vision problems suffer from inherent ambiguities. In clinical applications for example, it might not be clear from a CT scan alone which particular region is cancer tissue. Therefore a group of graders typically produces a set of diverse but plausible segmentations. We consider the task of learning a distribution over segmentations given an input. To this end we propose a generative segmentation model based on a combination of a U-Net with a conditional variational autoencoder that is capable of efficiently producing an unlimited number of plausible hypotheses. We show on a lung abnormalities segmentation task and on a Cityscapes segmentation task that our model reproduces the possible segmentation variants as well as the frequencies with which they occur, doing so significantly better than published approaches. These models could have a high impact in real-world applications, such as being used as clinical decision-making algorithms accounting for multiple plausible semantic segmentation hypotheses to provide possible diagnoses and recommend further actions to resolve the present ambiguities.

Simon A. A. Kohl, Bernardino Romera-Paredes, Clemens Meyer, Jeffrey De Fauw, Joseph R. Ledsam, Klaus H. Maier-Hein, S. M. Ali Eslami, Danilo Jimenez Rezende, Olaf Ronneberger• 2018

Related benchmarks

Task	Dataset	Result
Semantic segmentation	Cityscapes	mIoU63.2	526
Crack Segmentation	CRACK500	--	31
Multi-rater Medical Image Segmentation	LIDC-IDRI (test)	GED0.2168	26
Medical Image Segmentation	LIDC-IDRI (test)	GED0.29	24
Diverse Image Segmentation	MMFire	HM IoU68.7	24
Medical Image Segmentation	LIDC-IDRI	GED0.2168	23
Diverse Image Segmentation	LIDC	HM IoU78.5	18
Diverse Image Segmentation	Cityscapes	Image Quality Score91.6	18
Multi-rater Medical Image Segmentation	NPC-170 in-house (test)	GED0.4465	15
Medical Image Segmentation	MMIS (test)	GED0.227	12

Showing 10 of 36 rows

Other info

Follow for update

@wizwand_team Discord