Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Fully Convolutional Two-Stream Fusion Network for Interactive Image Segmentation

About

In this paper, we propose a novel fully convolutional two-stream fusion network (FCTSFN) for interactive image segmentation. The proposed network includes two sub-networks: a two-stream late fusion network (TSLFN) that predicts the foreground at a reduced resolution, and a multi-scale refining network (MSRN) that refines the foreground at full resolution. The TSLFN includes two distinct deep streams followed by a fusion network. The intuition is that, since user interactions are more direct information on foreground/background than the image itself, the two-stream structure of the TSLFN reduces the number of layers between the pure user interaction features and the network output, allowing the user interactions to have a more direct impact on the segmentation result. The MSRN fuses the features from different layers of TSLFN with different scales, in order to seek the local to global information on the foreground to refine the segmentation result at full resolution. We conduct comprehensive experiments on four benchmark datasets. The results show that the proposed network achieves competitive performance compared to current state-of-the-art interactive image segmentation methods

Yang Hu, Andrea Soltoggio, Russell Lock, Steve Carter• 2018

Related benchmarks

TaskDatasetResultRank
Interactive SegmentationBerkeley
NoC@906.49
230
Interactive SegmentationGrabCut
NoC@903.76
225
Interactive Instance SegmentationGrabCut (test)
NoC @ 90%3.76
14
Interactive Instance SegmentationCOCO (MVal)
NoC @ 85%9.62
13
Interactive Instance SegmentationBerkeley (test)
NoC @ 90%6.49
11
Interactive SegmentationPASCAL VOC 12 (val)
Clicks @ 85% IoU4.58
7
Showing 6 of 6 rows

Other info

Follow for update