Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MST: Adaptive Multi-Scale Tokens Guided Interactive Segmentation

About

Interactive segmentation has gained significant attention for its application in human-computer interaction and data annotation. To address the target scale variation issue in interactive segmentation, a novel multi-scale token adaptation algorithm is proposed. By performing top-k operations across multi-scale tokens, the computational complexity is greatly simplified while ensuring performance. To enhance the robustness of multi-scale token selection, we also propose a token learning algorithm based on contrastive loss. This algorithm can effectively improve the performance of multi-scale token adaptation. Extensive benchmarking shows that the algorithm achieves state-of-the-art (SOTA) performance, compared to current methods. An interactive demo and all reproducible codes will be released at https://github.com/hahamyt/mst.

Long Xu, Shanghong Li, Yongquan Chen, Jun Luo, Shiwu Lai• 2024

Related benchmarks

TaskDatasetResultRank
Interactive SegmentationBerkeley
NoC@901.5
230
Interactive SegmentationGrabCut
NoC@901.48
225
Interactive SegmentationDAVIS
NoC@904.55
197
Interactive SegmentationSBD
NoC @ 90% Target5.11
171
Interactive SegmentationPascal VOC
NoC@851.69
43
Interactive Mask CorrectionDAVIS From Initial Mask 585
NoC 851.8
22
Interactive Instance SegmentationCOCO (MVal)
NoC @ 85%2.08
13
Interactive SegmentationLoveDA (test)
NoC (80%)5.3
12
Interactive SegmentationGeneral Efficiency Benchmarking
Parameters (MB)166.8
6
Showing 9 of 9 rows

Other info

Code

Follow for update