Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Putting the Object Back into Video Object Segmentation

About

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result. Recent works on VOS employ bottom-up pixel-level memory reading which struggles due to matching noise, especially in the presence of distractors, resulting in lower performance in more challenging data. In contrast, Cutie performs top-down object-level memory reading by adapting a small set of object queries. Via those, it interacts with the bottom-up pixel features iteratively with a query-based object transformer (qt, hence Cutie). The object queries act as a high-level summary of the target object, while high-resolution feature maps are retained for accurate segmentation. Together with foreground-background masked attention, Cutie cleanly separates the semantics of the foreground object from the background. On the challenging MOSE dataset, Cutie improves by 8.7 J&F over XMem with a similar running time and improves by 4.2 J&F over DeAOT while being three times faster. Code is available at: https://hkchengrex.github.io/Cutie

Ho Kei Cheng, Seoung Wug Oh, Brian Price, Joon-Young Lee, Alexander Schwing• 2023

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean85.6
1226
Video Object SegmentationYouTube-VOS 2019 (val)
J-Score (Seen)86.8
240
Video Object SegmentationSA-V (val)
J&F Score61.3
114
Video Object SegmentationSA-V (test)
J&F62.8
110
Video Object SegmentationLVOS v2 (val)
J&F92.2
63
Video Object SegmentationMOSE (val)
J&F Score71.7
54
Semi-supervised Video Object SegmentationDAVIS 2017 (val)
J&F Score88.1
42
Video Object SegmentationMOSE
J&F Score68.3
29
Semi-supervised Video Object SegmentationSA-V (test)
J&F Score62.8
26
Semi-supervised Video Object SegmentationSA-V (val)
J&F Score61.3
26
Showing 10 of 50 rows

Other info

Code

Follow for update