Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CameraCtrl: Enabling Camera Control for Text-to-Video Generation

About

Controllability plays a crucial role in video generation, as it allows users to create and edit content more precisely. Existing models, however, lack control of camera pose that serves as a cinematic language to express deeper narrative nuances. To alleviate this issue, we introduce CameraCtrl, enabling accurate camera pose control for video diffusion models. Our approach explores effective camera trajectory parameterization along with a plug-and-play camera pose control module that is trained on top of a video diffusion model, leaving other modules of the base model untouched. Moreover, a comprehensive study on the effect of various training datasets is conducted, suggesting that videos with diverse camera distributions and similar appearance to the base model indeed enhance controllability and generalization. Experimental results demonstrate the effectiveness of CameraCtrl in achieving precise camera control with different video generation models, marking a step forward in the pursuit of dynamic and customized video storytelling from textual and camera pose inputs.

Hao He, Yinghao Xu, Yuwei Guo, Gordon Wetzstein, Bo Dai, Hongsheng Li, Ceyuan Yang• 2024

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisDL3DV (test)
PSNR12.2995
54
3D Scene GenerationWorldScore
Camera Control65.72
33
Novel View SynthesisRealEstate10K Hard
PSNR11.7599
20
Novel View SynthesisRealEstate10K Easy
PSNR13.6082
20
Novel View SynthesisRealEstate10K Medium
PSNR11.9639
14
Single-view Novel View SynthesisDL3DV Short-term (50th frame)
PSNR13.62
13
Single-view Novel View SynthesisDL3DV (Long-term (200th frame))
PSNR10.24
13
Single-view Novel View SynthesisRealEstate10K Long-term, 200th frame 84 (test)
PSNR11.16
13
Single-view Novel View SynthesisRealEstate10K Short-term, 50th frame 84 (test)
PSNR14.97
13
Camera ControllabilityRealEstate10K (test)
mRotErr1.44
10
Showing 10 of 49 rows

Other info

Follow for update