Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

About

Character Animation aims to generating character videos from still images through driving signals. Currently, diffusion models have become the mainstream in visual generation research, owing to their robust generative capabilities. However, challenges persist in the realm of image-to-video, especially in character animation, where temporally maintaining consistency with detailed information from character remains a formidable problem. In this paper, we leverage the power of diffusion models and propose a novel framework tailored for character animation. To preserve consistency of intricate appearance features from reference image, we design ReferenceNet to merge detail features via spatial attention. To ensure controllability and continuity, we introduce an efficient pose guider to direct character's movements and employ an effective temporal modeling approach to ensure smooth inter-frame transitions between video frames. By expanding the training data, our approach can animate arbitrary characters, yielding superior results in character animation compared to other image-to-video methods. Furthermore, we evaluate our method on benchmarks for fashion video and human dance synthesis, achieving state-of-the-art results.

Li Hu, Xin Gao, Peng Zhang, Ke Sun, Bang Zhang, Liefeng Bo• 2023

Related benchmarks

Task	Dataset	Result
Human Image Animation	RealisDance (val)	Subject Consistency94.65	27
Fashion video synthesis	UBC fashion video dataset (test)	SSIM0.931	18
Human Dance Generation	Tiktok (test)	SSIM0.718	17
Character Image Animation	Follow-Your-Pose V2	LPIPS0.183	15
Human Image Animation	TikTok	FVD171.9	15
Human Image Animation	Tiktok (test)	FVD935.6	15
Audio-driven half-body human video generation	EMTD 1.0 (evaluation set)	FID58.98	14
Video Generation	Tiktok (test)	SSIM0.86	11
Sign Language Video Generation	RWTH-PHOENIX-Weather 2014T (test)	SSIM79.4	10
Motion-Controlled Video Generation	RealisDance (val)	Average Score83.78	10

Showing 10 of 67 rows

Other info

Code

Follow for update

@wizwand_team Discord