Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ATI: Any Trajectory Instruction for Controllable Video Generation

About

We propose a unified framework for motion control in video generation that seamlessly integrates camera movement, object-level translation, and fine-grained local motion using trajectory-based inputs. In contrast to prior methods that address these motion types through separate modules or task-specific designs, our approach offers a cohesive solution by projecting user-defined trajectories into the latent space of pre-trained image-to-video generation models via a lightweight motion injector. Users can specify keypoints and their motion paths to control localized deformations, entire object motion, virtual camera dynamics, or combinations of these. The injected trajectory signals guide the generative process to produce temporally consistent and semantically aligned motion sequences. Our framework demonstrates superior performance across multiple video motion control tasks, including stylized motion effects (e.g., motion brushes), dynamic viewpoint changes, and precise local motion manipulation. Experiments show that our method provides significantly better controllability and visual quality compared to prior approaches and commercial solutions, while remaining broadly compatible with various state-of-the-art video generation backbones. Project page: https://anytraj.github.io/.

Angtian Wang, Haibin Huang, Jacob Zhiyuan Fang, Yiding Yang, Chongyang Ma• 2025

Related benchmarks

TaskDatasetResultRank
Controllable Video GenerationUser Study
Trajectory Following Score19
4
Pose-guided video generationXPose Non-human (test)
PSNR30.15
4
Trajectory-guided video generation100 image-trajectory pairs
ObjMC127.2
4
Showing 3 of 3 rows

Other info

Follow for update