Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis

About

Existing 3D-aware facial generation methods face a dilemma in quality versus editability: they either generate editable results in low resolution or high-quality ones with no editing flexibility. In this work, we propose a new approach that brings the best of both worlds together. Our system consists of three major components: (1) a 3D-semantics-aware generative model that produces view-consistent, disentangled face images and semantic masks; (2) a hybrid GAN inversion approach that initialize the latent codes from the semantic and texture encoder, and further optimized them for faithful reconstruction; and (3) a canonical editor that enables efficient manipulation of semantic masks in canonical view and product high-quality editing results. Our approach is competent for many applications, e.g. free-view face drawing, editing, and style control. Both quantitative and qualitative results show that our method reaches the state-of-the-art in terms of photorealism, faithfulness, and efficiency.

Jingxiang Sun, Xuan Wang, Yichun Shi, Lizhen Wang, Jue Wang, Yebin Liu• 2022

Related benchmarks

TaskDatasetResultRank
Identity PreservationFace Images OOD
Accuracy (eyeglasses)88.11
8
Identity PreservationOOD Face Videos
Eyeglasses Consistency87.67
8
ReconstructionOOD videos Images
LPIPS0.5044
8
ReconstructionOOD videos
LPIPS0.4999
8
Novel View SynthesisCelebA-HQ
ID Similarity67.1
7
3D-aware Portrait SynthesisFFHQ 512x512 (train test)
FID4.6
5
3D-aware Portrait SynthesisCelebAHQ-Mask 512x512 (test)
FID4.9
4
GAN InversionCelebA-HQ 1500 images (test)
PSNR26.45
4
Showing 8 of 8 rows

Other info

Follow for update