Unsupervised Foreground Extraction via Deep Region Competition
About
We present Deep Region Competition (DRC), an algorithm designed to extract foreground objects from images in a fully unsupervised manner. Foreground extraction can be viewed as a special case of generic image segmentation that focuses on identifying and disentangling objects from the background. In this work, we rethink the foreground extraction by reconciling energy-based prior with generative image modeling in the form of Mixture of Experts (MoE), where we further introduce the learned pixel re-assignment as the essential inductive bias to capture the regularities of background regions. With this modeling, the foreground-background partition can be naturally found through Expectation-Maximization (EM). We show that the proposed method effectively exploits the interaction between the mixture components during the partitioning process, which closely connects to region competition, a seminal approach for generic image segmentation. Experiments demonstrate that DRC exhibits more competitive performances on complex real-world data and challenging multi-object scenes compared with prior methods. Moreover, we show empirically that DRC can potentially generalize to novel foreground objects even from categories unseen during training.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | miniImageNet standard (test) | Accuracy43.14 | 61 | |
| Image Classification | CelebA (test) | Accuracy59.67 | 37 | |
| Unsupervised Object Segmentation | CUB | Jaccard Index56.4 | 16 | |
| Foreground extraction | Birds (test) | IoU54.6 | 10 | |
| Foreground extraction | Dogs (train) | IoU71.7 | 10 | |
| Foreground extraction | Dogs (test) | IoU72.3 | 10 | |
| Foreground extraction | Cars (train) | IoU72.4 | 10 | |
| Foreground extraction | Cars (test) | IoU70.8 | 10 | |
| Foreground extraction | Caltech-UCSD Birds-200 2011 (train) | IoU56.4 | 7 | |
| Foreground extraction | CLEVR6 (train) | IoU84.7 | 5 |