The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation
About
State-of-the-art approaches for semantic image segmentation are built on Convolutional Neural Networks (CNNs). The typical segmentation architecture is composed of (a) a downsampling path responsible for extracting coarse semantic features, followed by (b) an upsampling path trained to recover the input image resolution at the output of the model and, optionally, (c) a post-processing module (e.g. Conditional Random Fields) to refine the model predictions. Recently, a new CNN architecture, Densely Connected Convolutional Networks (DenseNets), has shown excellent results on image classification tasks. The idea of DenseNets is based on the observation that if each layer is directly connected to every other layer in a feed-forward fashion then the network will be more accurate and easier to train. In this paper, we extend DenseNets to deal with the problem of semantic segmentation. We achieve state-of-the-art results on urban scene benchmark datasets such as CamVid and Gatech, without any further post-processing module nor pretraining. Moreover, due to smart construction of the model, our approach has much less parameters than currently published best entries for these datasets. Code to reproduce the experiments is available here : https://github.com/SimJeg/FC-DenseNet/blob/master/train.py
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | CamVid (test) | mIoU66.9 | 411 | |
| Brain Tissue Segmentation | iSEG challenge 2017 (test) | CSF DSC91.26 | 20 | |
| Semantic segmentation | MFNet 1.0 (nighttime) | mAcc53.1 | 15 | |
| Semantic segmentation | MFNet 1.0 (daytime) | mAcc52.9 | 15 | |
| Semantic segmentation | CamVid 11 classes (test) | mIoU66.9 | 13 | |
| Semantic segmentation | WSOC (test) | mIoU (mean)0.38 | 10 | |
| Robot navigation | Simulated environment (test) | Interventions32.7 | 8 | |
| Semantic segmentation | GATECH (test) | -- | 8 | |
| Lane Marking Detection | TuSimple lane marking challenge as of March 14, 2018 (test) | Accuracy94.54 | 7 | |
| Semantic segmentation | Gatech | Accuracy79.4 | 4 |