Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions

About

Shot transitions play a pivotal role in multi-shot video generation, as they determine the overall narrative expression and the directorial design of visual storytelling. However, recent progress has primarily focused on low-level visual consistency across shots, neglecting how transitions are designed and how cinematographic language contributes to coherent narrative expression. This often leads to mere sequential shot changes without intentional film-editing patterns. To address this limitation, we propose ShotDirector, an efficient framework that integrates parameter-level camera control and hierarchical editing-pattern-aware prompting. Specifically, we adopt a camera control module that incorporates 6-DoF poses and intrinsic settings to enable precise camera information injection. In addition, a shot-aware mask mechanism is employed to introduce hierarchical prompts aware of professional editing patterns, allowing fine-grained control over shot content. Through this design, our framework effectively combines parameter-level conditions with high-level semantic guidance, achieving film-like controllable shot transitions. To facilitate training and evaluation, we construct ShotWeaver40K, a dataset that captures the priors of film-like editing patterns, and develop a set of evaluation metrics for controllable multi-shot video generation. Extensive experiments demonstrate the effectiveness of our framework.

Xiaoxue Wu, Xinyuan Chen, Yaohui Wang, Yu Qiao• 2025

Related benchmarks

TaskDatasetResultRank
Multi-shot Video Generation90 prompts evaluation suite
Type Accuracy67.44
9
Showing 1 of 1 rows

Other info

Follow for update