MarioNette: Self-Supervised Sprite Learning
About
Artists and video game designers often construct 2D animations using libraries of sprites -- textured patches of objects and characters. We propose a deep learning approach that decomposes sprite-based video animations into a disentangled representation of recurring graphic elements in a self-supervised manner. By jointly learning a dictionary of possibly transparent patches and training a network that places them onto a canvas, we deconstruct sprite-based content into a sparse, consistent, and explicit representation that can be easily used in downstream tasks, like editing or analysis. Our framework offers a promising approach for discovering recurring visual patterns in image collections without supervision.
Dmitriy Smirnov, Michael Gharbi, Matthew Fisher, Vitor Guizilini, Alexei A. Efros, Justin Solomon• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Unsupervised Object Segmentation | CLEVRTEX 1.0 (test) | FG-ARI38.34 | 20 | |
| Unsupervised Object Segmentation | CAMO 1.0 (test) | FG-ARI31.54 | 16 | |
| Unsupervised Object Segmentation | CLEVR 1.0 (test) | FG-ARI72.12 | 16 | |
| Unsupervised Object Segmentation | OOD 1.0 (test) | FG-ARI3.73e+3 | 16 |
Showing 4 of 4 rows