Conditional Random Fields as Recurrent Neural Networks
About
Pixel-level labelling tasks, such as semantic segmentation, play a central role in image understanding. Recent approaches have attempted to harness the capabilities of deep learning techniques for image recognition to tackle pixel-level labelling tasks. One central issue in this methodology is the limited capacity of deep learning techniques to delineate visual objects. To solve this problem, we introduce a new form of convolutional neural network that combines the strengths of Convolutional Neural Networks (CNNs) and Conditional Random Fields (CRFs)-based probabilistic graphical modelling. To this end, we formulate mean-field approximate inference for the Conditional Random Fields with Gaussian pairwise potentials as Recurrent Neural Networks. This network, called CRF-RNN, is then plugged in as a part of a CNN to obtain a deep network that has desirable properties of both CNNs and CRFs. Importantly, our system fully integrates CRF modelling with CNNs, making it possible to train the whole deep network end-to-end with the usual back-propagation algorithm, avoiding offline post-processing methods for object delineation. We apply the proposed method to the problem of semantic image segmentation, obtaining top results on the challenging Pascal VOC 2012 segmentation benchmark.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (test) | mIoU74.7 | 1342 | |
| Semantic segmentation | Cityscapes (test) | mIoU62.5 | 1145 | |
| Semantic segmentation | PASCAL Context (val) | mIoU39.3 | 323 | |
| Semantic segmentation | Pascal VOC (test) | mIoU74.7 | 236 | |
| Semantic segmentation | Pascal Context (test) | mIoU39.3 | 176 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (test) | mIoU53.7 | 158 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (val) | mIoU52.8 | 154 | |
| Medical Image Segmentation | ACDC (test) | Avg DSC67.2 | 135 | |
| Semantic segmentation | PASCAL-Context 59 class (val) | mIoU39.3 | 125 | |
| Semantic segmentation | Pascal Context | mIoU39.3 | 111 |