Conditional Random Fields as Recurrent Neural Networks
About
Pixel-level labelling tasks, such as semantic segmentation, play a central role in image understanding. Recent approaches have attempted to harness the capabilities of deep learning techniques for image recognition to tackle pixel-level labelling tasks. One central issue in this methodology is the limited capacity of deep learning techniques to delineate visual objects. To solve this problem, we introduce a new form of convolutional neural network that combines the strengths of Convolutional Neural Networks (CNNs) and Conditional Random Fields (CRFs)-based probabilistic graphical modelling. To this end, we formulate mean-field approximate inference for the Conditional Random Fields with Gaussian pairwise potentials as Recurrent Neural Networks. This network, called CRF-RNN, is then plugged in as a part of a CNN to obtain a deep network that has desirable properties of both CNNs and CRFs. Importantly, our system fully integrates CRF modelling with CNNs, making it possible to train the whole deep network end-to-end with the usual back-propagation algorithm, avoiding offline post-processing methods for object delineation. We apply the proposed method to the problem of semantic image segmentation, obtaining top results on the challenging Pascal VOC 2012 segmentation benchmark.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (test) | mIoU74.7 | 1415 | |
| Semantic segmentation | Cityscapes (test) | mIoU62.5 | 1154 | |
| Semantic segmentation | PASCAL Context (val) | mIoU39.3 | 360 | |
| Semantic segmentation | Pascal VOC (test) | mIoU74.7 | 236 | |
| Semantic segmentation | Pascal Context | mIoU39.3 | 217 | |
| Semantic segmentation | Pascal Context (test) | mIoU39.3 | 191 | |
| Medical Image Segmentation | ACDC (test) | Avg DSC67.2 | 171 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (val) | mIoU52.8 | 168 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (test) | mIoU53.7 | 158 | |
| Semantic segmentation | Pascal Context 60 | mIoU39.3 | 139 |