RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs
About
Blind face restoration is to recover a high-quality face image from unknown degradations. As face image contains abundant contextual information, we propose a method, RestoreFormer, which explores fully-spatial attentions to model contextual information and surpasses existing works that use local operators. RestoreFormer has several benefits compared to prior arts. First, unlike the conventional multi-head self-attention in previous Vision Transformers (ViTs), RestoreFormer incorporates a multi-head cross-attention layer to learn fully-spatial interactions between corrupted queries and high-quality key-value pairs. Second, the key-value pairs in ResotreFormer are sampled from a reconstruction-oriented high-quality dictionary, whose elements are rich in high-quality facial features specifically aimed for face reconstruction, leading to superior restoration results. Third, RestoreFormer outperforms advanced state-of-the-art methods on one synthetic dataset and three real-world datasets, as well as produces images with better visual quality.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Blind Face Restoration | LFW (test) | FID47.75 | 52 | |
| Blind Face Restoration | CelebA (test) | SSIM64.6 | 44 | |
| Blind Face Restoration | WebPhoto (test) | FID77.33 | 35 | |
| Deblurring | Deblurring (test) | PSNR32.92 | 22 | |
| Blind Face Restoration | WIDER (test) | FID49.817 | 17 | |
| Under-display camera image restoration | TOLED 75 (test) | PSNR20.98 | 16 | |
| Under-display camera image restoration | POLED 75 (test) | PSNR9.04 | 16 | |
| Image Dehazing | Dehazing (test) | PSNR30.87 | 15 | |
| Image Deraining | 5sets (test) | PSNR33.96 | 14 | |
| Blind Face Restoration | CelebAdult (test) | FID104 | 12 |