Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement
About
We propose a new matching-based framework for semi-supervised video object segmentation (VOS). Recently, state-of-the-art VOS performance has been achieved by matching-based algorithms, in which feature banks are created to store features for region matching and classification. However, how to effectively organize information in the continuously growing feature bank remains under-explored, and this leads to inefficient design of the bank. We introduce an adaptive feature bank update scheme to dynamically absorb new features and discard obsolete features. We also design a new confidence loss and a fine-grained segmentation module to enhance the segmentation accuracy in uncertain regions. On public benchmarks, our algorithm outperforms existing state-of-the-arts.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Object Segmentation | DAVIS 2017 (val) | J mean74.4 | 1130 | |
| Video Object Segmentation | YouTube-VOS 2018 (val) | J Score (Seen)78.8 | 493 | |
| Video Object Segmentation | YouTube-VOS 2018 | Score G79.6 | 47 | |
| Video Object Segmentation | DAVIS 2017 | Jaccard Index (J)73 | 42 | |
| Video Object Segmentation | Long-time Video dataset (val) | J&F Score83.7 | 21 | |
| Video Object Segmentation | Long-time Video dataset | J M82.9 | 13 | |
| Video Object Segmentation | Long-Videos | J_m0.827 | 11 | |
| Breast Lesion Segmentation | Breast Lesion Ultrasound Video dataset (test) | Dice75 | 10 | |
| Referring Video Object Segmentation | LVOS (val) | J&F Score34.8 | 10 | |
| Video Object Segmentation | Long-Videos (test) | Jaccard Mean (J_M)0.827 | 8 |