Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness

About

Stories and emotions in movies emerge through the effect of well-thought-out directing decisions, in particular camera placement and movement over time. Crafting compelling camera trajectories remains a complex iterative process, even for skilful artists. To tackle this, in this paper, we propose a dataset called the Exceptional Trajectories (E.T.) with camera trajectories along with character information and textual captions encompassing descriptions of both camera and character. To our knowledge, this is the first dataset of its kind. To show the potential applications of the E.T. dataset, we propose a diffusion-based approach, named DIRECTOR, which generates complex camera trajectories from textual captions that describe the relation and synchronisation between the camera and characters. To ensure robust and accurate evaluations, we train on the E.T. dataset CLaTr, a Contrastive Language-Trajectory embedding for evaluation metrics. We posit that our proposed dataset and method significantly advance the democratization of cinematography, making it more accessible to common users.

Robin Courant, Nicolas Dufour, Xi Wang, Marc Christie, Vicky Kalogeiton• 2024

Related benchmarks

TaskDatasetResultRank
Text-to-TrajectoryDataDoP (test)
F1-Score31.9
6
Text-to-TrajectoryShotVerse-Bench (test)
F1-Score28.9
6
Camera Trajectory GenerationShotBench (test)
FCD26.28
4
Direct Rendering Visual Quality AssessmentShotBench Unity-rendered (test)
Misalignment Rate0.803
4
Video Generation Visual Quality AssessmentShotBench Controllable Video Generation (test)
Consistency90.8
4
Human Preference RankingUnity Previsualization
Average Rank3.27
4
Human Preference RankingVideo Gen Downstream
Average Rank3.21
4
Showing 7 of 7 rows

Other info

Follow for update