Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

About

We present TrajectoryCrafter, a novel approach to redirect camera trajectories for monocular videos. By disentangling deterministic view transformations from stochastic content generation, our method achieves precise control over user-specified camera trajectories. We propose a novel dual-stream conditional video diffusion model that concurrently integrates point cloud renders and source videos as conditions, ensuring accurate view transformations and coherent 4D content generation. Instead of leveraging scarce multi-view videos, we curate a hybrid training dataset combining web-scale monocular videos with static multi-view datasets, by our innovative double-reprojection strategy, significantly fostering robust generalization across diverse scenes. Extensive evaluations on multi-view and large-scale monocular videos demonstrate the superior performance of our method.

Mark YU, Wenbo Hu, Jinbo Xing, Ying Shan• 2025

Related benchmarks

TaskDatasetResultRank
Video GenerationVBench--
126
Single-object 4D Motion GenerationUser Study Single-object 4D Motion Generation 1.0 (test)
Prompt Alignment5
36
Novel View SynthesisiPhone dataset
SSIM0.492
33
4D Scene ReconstructioniPhone
Apple Scene Score13.88
21
View SynchronizationBasic Benchmark (test)
FVD665.9
20
Video GenerationRealEstate10K (Re10K) (test)
PSNR16.94
16
Video GenerationRealEstate10K and DL3DV partial-revisit (evaluation)
Total Quality Score76.34
11
Video GenerationDL3DV
PSNR21.42
10
I2V Camera ControlDL3DV (test)
RRE1.08
10
Dynamic ReconstructionDyCheck
PSNR14.34
8
Showing 10 of 79 rows
...

Other info

Follow for update