GAN-Supervised Dense Visual Alignment

About

We propose GAN-Supervised Learning, a framework for learning discriminative models and their GAN-generated training data jointly end-to-end. We apply our framework to the dense visual alignment problem. Inspired by the classic Congealing method, our GANgealing algorithm trains a Spatial Transformer to map random samples from a GAN trained on unaligned data to a common, jointly-learned target mode. We show results on eight datasets, all of which demonstrate our method successfully aligns complex data and discovers dense correspondences. GANgealing significantly outperforms past self-supervised correspondence algorithms and performs on-par with (and sometimes exceeds) state-of-the-art supervised correspondence algorithms on several datasets -- without making use of any correspondence supervision or data augmentation and despite being trained exclusively on GAN-generated data. For precise correspondence, we improve upon state-of-the-art supervised methods by as much as $3\times$. We show applications of our method for augmented reality, image editing and automated pre-processing of image datasets for downstream GAN training.

William Peebles, Jun-Yan Zhu, Richard Zhang, Antonio Torralba, Alexei A. Efros, Eli Shechtman• 2021

Related benchmarks

Task	Dataset	Result
Semantic Correspondence	SPair-71k (test)	PCK@0.135.2	146
Keypoint Transfer	SPair-71k (test)	Bicycle37.5	38
Semantic Correspondence	CUB	PCK@0.156.8	14
Dense Correspondence	CUB (val)	PCK@0.157.5	10
Visual Concept Averaging	Animal, Person, Object, and Abstract categories (test)	Consistency (CLIP)0.00e+0	5

Showing 5 of 5 rows

Other info

Code

Follow for update

@wizwand_team Discord