PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
About
Generating portrait images by controlling the motions of existing faces is an important task of great consequence to social media industries. For easy use and intuitive control, semantically meaningful and fully disentangled parameters should be used as modifications. However, many existing techniques do not provide such fine-grained controls or use indirect editing methods i.e. mimic motions of other individuals. In this paper, a Portrait Image Neural Renderer (PIRenderer) is proposed to control the face motions with the parameters of three-dimensional morphable face models (3DMMs). The proposed model can generate photo-realistic portrait images with accurate movements according to intuitive modifications. Experiments on both direct and indirect editing tasks demonstrate the superiority of this model. Meanwhile, we further extend this model to tackle the audio-driven facial reenactment task by extracting sequential motions from audio inputs. We show that our model can generate coherent videos with convincing movements from only a single reference image and a driving audio stream. Our source code is available at https://github.com/RenYurui/PIRender.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Talking head synthesis | User Study | -- | 18 | |
| Same-identity reconstruction | VoxCeleb 1 (test) | L1 Loss0.0566 | 7 | |
| Expression Editing | VoxCeleb and HDTF Cross-Identity Reenactment (test) | CSIM0.9133 | 6 | |
| Talking Face Generation | One-shot talking face generation Cross-Identity Reenactment | CSIM0.8215 | 5 | |
| Cross-identity reenactment | Multi-view dataset (test) | CSIM0.632 | 5 | |
| Multi-View Reenactment | Multi-view dataset (test) | PSNR20.04 | 5 | |
| Talking Face Generation | One-shot talking face generation Same-Identity Reenactment | CSIM88.29 | 5 | |
| Expression Editing | VoxCeleb and HDTF Same-Identity Reenactment (test) | CSIM0.9075 | 3 | |
| Pose Editing | VoxCeleb and HDTF Same-Identity Reenactment (test) | CSIM0.9055 | 3 |