The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

About

The task of manipulating real image attributes through StyleGAN inversion has been extensively researched. This process involves searching latent variables from a well-trained StyleGAN generator that can synthesize a real image, modifying these latent variables, and then synthesizing an image with the desired edits. A balance must be struck between the quality of the reconstruction and the ability to edit. Earlier studies utilized the low-dimensional W-space for latent search, which facilitated effective editing but struggled with reconstructing intricate details. More recent research has turned to the high-dimensional feature space F, which successfully inverses the input image but loses much of the detail during editing. In this paper, we introduce StyleFeatureEditor -- a novel method that enables editing in both w-latents and F-latents. This technique not only allows for the reconstruction of finer image details but also ensures their preservation during editing. We also present a new training pipeline specifically designed to train our model to accurately edit F-latents. Our method is compared with state-of-the-art encoding approaches, demonstrating that our model excels in terms of reconstruction quality and is capable of editing even challenging out-of-domain examples. Code is available at https://github.com/AIRI-Institute/StyleFeatureEditor.

Denis Bobkov, Vadim Titov, Aibek Alanov, Dmitry Vetrov• 2024

Related benchmarks

Task	Dataset	Result
Image Reconstruction	FFHQ No glasses	LPIPS0.023	18
Image Reconstruction	FFHQ Glasses	LPIPS0.024	18
Image Editing (Add glasses)	FFHQ (test)	ID-Sim0.71	15
Image Editing (Remove glasses)	FFHQ (test)	ID-Sim0.789	15
Attribute Classification	FFHQ (test)	Accuracy97.3	15
Facial Expression Synthesis	CFEE	ECS0.7	3

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord