GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
About
Prevalent semantic segmentation solutions are, in essence, a dense discriminative classifier of p(class|pixel feature). Though straightforward, this de facto paradigm neglects the underlying data distribution p(pixel feature|class), and struggles to identify out-of-distribution data. Going beyond this, we propose GMMSeg, a new family of segmentation models that rely on a dense generative classifier for the joint distribution p(pixel feature,class). For each class, GMMSeg builds Gaussian Mixture Models (GMMs) via Expectation-Maximization (EM), so as to capture class-conditional densities. Meanwhile, the deep dense representation is end-to-end trained in a discriminative manner, i.e., maximizing p(class|pixel feature). This endows GMMSeg with the strengths of both generative and discriminative models. With a variety of segmentation architectures and backbones, GMMSeg outperforms the discriminative counterparts on three closed-set datasets. More impressively, without any modification, GMMSeg even performs well on open-world datasets. We believe this work brings fundamental insights into the related fields.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | ADE20K (val) | mIoU56.7 | 2731 | |
| Semantic segmentation | Cityscapes (test) | mIoU81.1 | 1145 | |
| Semantic segmentation | Cityscapes (val) | mIoU83.8 | 332 | |
| Semantic segmentation | Coco-Stuff (test) | mIoU52 | 184 | |
| Anomaly Segmentation | Fishyscapes Lost & Found (test) | FPR@956.6 | 61 | |
| Anomaly Segmentation | Fishyscapes Lost & Found (val) | FPR9512.55 | 53 | |
| Anomaly Segmentation | Road Anomaly (test) | FPR9544.34 | 47 | |
| Anomaly Segmentation | Fishyscapes Static (test) | FPR9515.96 | 28 | |
| Out-of-Distribution Detection | Fishyscapes Lost & Found (test) | AP55.63 | 11 | |
| Out-of-Distribution Detection | Fishyscapes Static (test) | AP76.02 | 11 |