MarioNette: Self-Supervised Sprite Learning

About

Artists and video game designers often construct 2D animations using libraries of sprites -- textured patches of objects and characters. We propose a deep learning approach that decomposes sprite-based video animations into a disentangled representation of recurring graphic elements in a self-supervised manner. By jointly learning a dictionary of possibly transparent patches and training a network that places them onto a canvas, we deconstruct sprite-based content into a sparse, consistent, and explicit representation that can be easily used in downstream tasks, like editing or analysis. Our framework offers a promising approach for discovering recurring visual patterns in image collections without supervision.

Dmitriy Smirnov, Michael Gharbi, Matthew Fisher, Vitor Guizilini, Alexei A. Efros, Justin Solomon• 2021

Related benchmarks

Task	Dataset	Result
Unsupervised Object Segmentation	CLEVRTEX 1.0 (test)	FG-ARI38.34	20
Unsupervised Object Segmentation	CAMO 1.0 (test)	FG-ARI31.54	16
Unsupervised Object Segmentation	CLEVR 1.0 (test)	FG-ARI72.12	16
Unsupervised Object Segmentation	OOD 1.0 (test)	FG-ARI3.73e+3	16
Instance Segmentation	CLEVR	mIoU72.1	11

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord