Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

InterFormer: Real-time Interactive Image Segmentation

About

Interactive image segmentation enables annotators to efficiently perform pixel-level annotation for segmentation tasks. However, the existing interactive segmentation pipeline suffers from inefficient computations of interactive models because of the following two issues. First, annotators' later click is based on models' feedback of annotators' former click. This serial interaction is unable to utilize model's parallelism capabilities. Second, in each interaction step, the model handles the invariant image along with the sparse variable clicks, resulting in a process that's highly repetitive and redundant. For efficient computations, we propose a method named InterFormer that follows a new pipeline to address these issues. InterFormer extracts and preprocesses the computationally time-consuming part i.e. image processing from the existing process. Specifically, InterFormer employs a large vision transformer (ViT) on high-performance devices to preprocess images in parallel, and then uses a lightweight module called interactive multi-head self attention (I-MSA) for interactive segmentation. Furthermore, the I-MSA module's deployment on low-power devices extends the practical application of interactive segmentation. The I-MSA module utilizes the preprocessed features to efficiently response to the annotator inputs in real-time. The experiments on several datasets demonstrate the effectiveness of InterFormer, which outperforms previous interactive segmentation models in terms of computational efficiency and segmentation quality, achieve real-time high-quality interactive segmentation on CPU-only devices. The code is available at https://github.com/YouHuang67/InterFormer.

You Huang, Hao Yang, Ke Sun, Shengchuan Zhang, Liujuan Cao, Guannan Jiang, Rongrong Ji• 2023

Related benchmarks

TaskDatasetResultRank
Interactive SegmentationBerkeley
NoC@902.53
230
Interactive SegmentationGrabCut
NoC@901.36
225
Interactive SegmentationDAVIS
NoC@905.21
197
Interactive SegmentationSBD
NoC @ 90% Target5.51
171
Interactive Image SegmentationGrabCut
NoC@901.36
28
Interactive Image SegmentationDAVIS
NoC @ 90% IoU5.21
27
Interactive Image SegmentationSBD
NoC905.51
16
Interactive SegmentationMVTec
NoC@9010.84
13
Interactive SegmentationCOD10K
NoC@909.42
13
Interactive Image SegmentationHQSeg-44K (val)
5-mIoU82.62
12
Showing 10 of 10 rows

Other info

Follow for update