OSDFace: One-Step Diffusion Model for Face Restoration

About

Diffusion models have demonstrated impressive performance in face restoration. Yet, their multi-step inference process remains computationally intensive, limiting their applicability in real-world scenarios. Moreover, existing methods often struggle to generate face images that are harmonious, realistic, and consistent with the subject's identity. In this work, we propose OSDFace, a novel one-step diffusion model for face restoration. Specifically, we propose a visual representation embedder (VRE) to better capture prior information and understand the input face. In VRE, low-quality faces are processed by a visual tokenizer and subsequently embedded with a vector-quantized dictionary to generate visual prompts. Additionally, we incorporate a facial identity loss derived from face recognition to further ensure identity consistency. We further employ a generative adversarial network (GAN) as a guidance model to encourage distribution alignment between the restored face and the ground truth. Experimental results demonstrate that OSDFace surpasses current state-of-the-art (SOTA) methods in both visual quality and quantitative metrics, generating high-fidelity, natural face images with high identity consistency. The code and model will be released at https://github.com/jkwang28/OSDFace.

Jingkai Wang, Jue Gong, Lin Zhang, Zheng Chen, Xing Liu, Hong Gu, Yutong Liu, Yulun Zhang, Xiaokang Yang• 2024

Related benchmarks

Task	Dataset	Result
Face Restoration	CelebA (test)	LPIPS0.2561	32
Face Restoration	CelebA synthetic (test)	LPIPS0.336	26
Video Face Restoration	VFHQ (test)	PSNR24.56	25
Video Face Restoration	CelebV-HQ (test)	PSNR24.26	16
Suspect Face Generation	ID-FFHQ	AF Match Rate0.894	13
Suspect Face Generation	CelebA ID	AF Match Rate0.951	13
Face Restoration	Face Restoration 512x512 resolution	Latency (s)0.1	10
Face Video Restoration	161-frame 512x512 video	FPS1.988	10
Face Restoration	WebPhoto 43 (test)	MUSIQ73.93	7
Face Restoration	Wider 61 (test)	MUSIQ74.6	7

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord