Teachers Do More Than Teach: Compressing Image-to-Image Models

About

Generative Adversarial Networks (GANs) have achieved huge success in generating high-fidelity images, however, they suffer from low efficiency due to tremendous computational cost and bulky memory usage. Recent efforts on compression GANs show noticeable progress in obtaining smaller generators by sacrificing image quality or involving a time-consuming searching process. In this work, we aim to address these issues by introducing a teacher network that provides a search space in which efficient network architectures can be found, in addition to performing knowledge distillation. First, we revisit the search space of generative models, introducing an inception-based residual block into generators. Second, to achieve target computation cost, we propose a one-step pruning algorithm that searches a student architecture from the teacher model and substantially reduces searching cost. It requires no l1 sparsity regularization and its associated hyper-parameters, simplifying the training procedure. Finally, we propose to distill knowledge through maximizing feature similarity between teacher and student via an index named Global Kernel Alignment (GKA). Our compressed networks achieve similar or even better image fidelity (FID, mIoU) than the original models with much-reduced computational cost, e.g., MACs. Code will be released at https://github.com/snap-research/CAT.

Qing Jin, Jian Ren, Oliver J. Woodford, Jiazhuo Wang, Geng Yuan, Yanzhi Wang, Sergey Tulyakov• 2021

Related benchmarks

Task	Dataset	Result
Image-to-Image Translation	Edges2Shoes (test)	FID43.97	24
Image-to-Image Translation	Horse -> Zebra	FID60.18	23
Image-to-Image Translation	Cityscapes	mIoU62.35	20
Unpaired Image-to-Image Translation	CycleGAN Horse -> Zebra (test)	FID67.31	19
Unpaired Image-to-Image Translation	Zebra -> Horse CycleGAN (test)	FID141	19
Image-to-Image Translation	horse2zebra	FID60.18	10
Image-to-Image	Horse -> Zebra	FID53.48	7
Paired image-to-image translation	Cityscapes (val)	Delta Change-0.06	7
Architecture Search	Cityscapes	Search Cost (GPU Seconds)4.28	3
Image-to-Image	Map → Aerial photo	FID45.63	3

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord