Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MotionClone: Training-Free Motion Cloning for Controllable Video Generation

About

Motion-based controllable video generation offers the potential for creating captivating visual content. Existing methods typically necessitate model training to encode particular motion cues or incorporate fine-tuning to inject certain motion patterns, resulting in limited flexibility and generalization. In this work, we propose MotionClone, a training-free framework that enables motion cloning from reference videos to versatile motion-controlled video generation, including text-to-video and image-to-video. Based on the observation that the dominant components in temporal-attention maps drive motion synthesis, while the rest mainly capture noisy or very subtle motions, MotionClone utilizes sparse temporal attention weights as motion representations for motion guidance, facilitating diverse motion transfer across varying scenarios. Meanwhile, MotionClone allows for the direct extraction of motion representation through a single denoising step, bypassing the cumbersome inversion processes and thus promoting both efficiency and flexibility. Extensive experiments demonstrate that MotionClone exhibits proficiency in both global camera motion and local object motion, with notable superiority in terms of motion fidelity, textual alignment, and temporal consistency.

Pengyang Ling, Jiazi Bu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Tong Wu, Huaian Chen, Jiaqi Wang, Yi Jin• 2024

Related benchmarks

TaskDatasetResultRank
Video GenerationVBench--
102
Motion TransferDAVIS Caption
MF Score0.635
12
Motion TransferDAVIS Subject
MF64
12
Motion TransferDAVIS All
MF0.634
12
Motion TransferDAVIS Scene
MF Score0.628
12
Motion CustomizationTGVE 76 videos (full)
Text Alignment27.23
12
Motion TransferDAVIS Easy
CLIP Score0.2996
9
Motion TransferDAVIS Medium
CLIP Score0.3014
9
Motion TransferDAVIS Hard
CLIP Score0.2974
9
Motion TransferDAVIS (All subsets)
CLIP Score0.2995
9
Showing 10 of 16 rows

Other info

Follow for update