Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

About

Camera control has been actively studied in text or image conditioned video generation tasks. However, altering camera trajectories of a given video remains under-explored, despite its importance in the field of video creation. It is non-trivial due to the extra constraints of maintaining multiple-frame appearance and dynamic synchronization. To address this, we present ReCamMaster, a camera-controlled generative video re-rendering framework that reproduces the dynamic scene of an input video at novel camera trajectories. The core innovation lies in harnessing the generative capabilities of pre-trained text-to-video models through a simple yet powerful video conditioning mechanism--its capability is often overlooked in current research. To overcome the scarcity of qualified training data, we construct a comprehensive multi-camera synchronized video dataset using Unreal Engine 5, which is carefully curated to follow real-world filming characteristics, covering diverse scenes and camera movements. It helps the model generalize to in-the-wild videos. Lastly, we further improve the robustness to diverse inputs through a meticulously designed training strategy. Extensive experiments show that our method substantially outperforms existing state-of-the-art approaches. Our method also finds promising applications in video stabilization, super-resolution, and outpainting. Our code and dataset are publicly available at: https://github.com/KwaiVGI/ReCamMaster.

Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Video GenerationVBench--
102
View SynchronizationBasic Benchmark (test)
FVD675.4
20
Camera ControllabilityRealEstate10K (test)
mRotErr1.935
10
Multi-shot Video Generation90 prompts evaluation suite
Type Accuracy3.33
9
Stereo Image ConversionMarvel-10K
PSNR30.44
8
Stereo Video ConversionMarvel-10K
PSNR30.41
8
Camera controlUltraVideo (test)
DINO0.0504
7
Stereo Video SynthesisStereo4D Parallel Format
MS-SSIM52.5
7
Video Novel View SynthesisSynthetic multi-camera video dataset (test)
Refinement Error (Iter 1 vs 2)0.1181
6
Cinematic Video GenerationScene-Decoupled Video Dataset (test)
CLIP-T30.86
6
Showing 10 of 35 rows

Other info

Follow for update