Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion

About

Motion, measured via optical flow, provides a powerful cue to discover and learn objects in images and videos. However, compared to using appearance, it has some blind spots, such as the fact that objects become invisible if they do not move. In this work, we propose an approach that combines the strengths of motion-based and appearance-based segmentation. We propose to supervise an image segmentation network with the pretext task of predicting regions that are likely to contain simple motion patterns, and thus likely to correspond to objects. As the model only uses a single image as input, we can apply it in two settings: unsupervised video segmentation, and unsupervised image segmentation. We achieve state-of-the-art results for videos, and demonstrate the viability of our approach on still images containing novel objects. Additionally we experiment with different motion models and optical flow backbones and find the method to be robust to these change. Project page and code available at https://www.robots.ox.ac.uk/~vgg/research/gwm.

Subhabrata Choudhury, Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht• 2022

Related benchmarks

TaskDatasetResultRank
Salient Object DetectionECSSD--
202
Unsupervised Video Object SegmentationDAVIS 2016 (val)--
108
Video Object SegmentationDAVIS
J Mean66.9
58
Unsupervised Video Object SegmentationSegTrack v2
Jaccard Score78.9
56
Unsupervised Video Object SegmentationDAVIS 2016 (test)
J Mean79.5
50
Unsupervised Video Object SegmentationFBMS59
Jaccard Score78.4
43
Single Object Video SegmentationSegTrack v2 (val)
J Mean78.9
27
RGB saliency detectionECSSD
F-measure (F_beta)74.3
25
Unsupervised Video Object SegmentationDAVIS 2016
Jaccard Score80.7
24
Unsupervised Video Object SegmentationFBMS-59 (test)
J Score78.4
17
Showing 10 of 21 rows

Other info

Code

Follow for update