Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

About

We present TrajectoryCrafter, a novel approach to redirect camera trajectories for monocular videos. By disentangling deterministic view transformations from stochastic content generation, our method achieves precise control over user-specified camera trajectories. We propose a novel dual-stream conditional video diffusion model that concurrently integrates point cloud renders and source videos as conditions, ensuring accurate view transformations and coherent 4D content generation. Instead of leveraging scarce multi-view videos, we curate a hybrid training dataset combining web-scale monocular videos with static multi-view datasets, by our innovative double-reprojection strategy, significantly fostering robust generalization across diverse scenes. Extensive evaluations on multi-view and large-scale monocular videos demonstrate the superior performance of our method.

Mark YU, Wenbo Hu, Jinbo Xing, Ying Shan• 2025

Related benchmarks

TaskDatasetResultRank
Video GenerationVBench--
126
Single-object 4D Motion GenerationUser Study Single-object 4D Motion Generation 1.0 (test)
Prompt Alignment5
36
4D Scene ReconstructioniPhone
Apple Scene Score13.88
21
View SynchronizationBasic Benchmark (test)
FVD665.9
20
Video GenerationRealEstate10K (Re10K) (test)
PSNR16.94
13
Video GenerationRealEstate10K and DL3DV partial-revisit (evaluation)
Total Quality Score76.34
11
I2V Camera ControlDL3DV (test)
RRE1.08
10
Video Trajectory EditingiPhone short clips
PSNR13
8
Camera controlUltraVideo (test)
DINO0.0376
7
Narrow Dynamic View SynthesisDyCheck iPhone 1.0 (test)
PSNR14.24
7
Showing 10 of 50 rows

Other info

Follow for update