Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

First Order Motion Model for Image Animation

About

Image animation consists of generating a video sequence so that an object in a source image is animated according to the motion of a driving video. Our framework addresses this problem without using any annotation or prior information about the specific object to animate. Once trained on a set of videos depicting objects of the same category (e.g. faces, human bodies), our method can be applied to any object of this class. To achieve this, we decouple appearance and motion information using a self-supervised formulation. To support complex motions, we use a representation consisting of a set of learned keypoints along with their local affine transformations. A generator network models occlusions arising during target motions and combines the appearance extracted from the source image and the motion derived from the driving video. Our framework scores best on diverse benchmarks and on a variety of object categories. Our source code is publicly available.

Aliaksandr Siarohin, St\'ephane Lathuili\`ere, Sergey Tulyakov, Elisa Ricci, Nicu Sebe• 2020

Related benchmarks

TaskDatasetResultRank
Talking head synthesisUser Study--
18
Human Dance GenerationTiktok (test)
SSIM0.648
17
Face ReenactmentVoxCeleb1 (test)
SSIM0.723
16
Head Avatar SynthesisHead Avatar Evaluation Dataset (test)
LPIPS0.298
10
Video ReconstructionTai-Chi-HD
L1 Loss0.055
10
Video self-reconstructionVoxCeleb1 (test)
L1 Loss0.0412
9
Cross-identity face animationVoxCeleb 1
ARD3.122
9
Video ReconstructionVoxCeleb
L1 Loss0.041
8
Video self-reconstructionCelebV-HQ (test)
L1 Error0.0531
8
Same-identity reconstructionVoxCeleb 1 (test)
L1 Loss0.045
7
Showing 10 of 51 rows

Other info

Code

Follow for update