Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Automated Movie Generation via Multi-Agent CoT Planning

About

Existing long-form video generation frameworks lack automated planning, requiring manual input for storylines, scenes, cinematography, and character interactions, resulting in high costs and inefficiencies. To address these challenges, we present MovieAgent, an automated movie generation via multi-agent Chain of Thought (CoT) planning. MovieAgent offers two key advantages: 1) We firstly explore and define the paradigm of automated movie/long-video generation. Given a script and character bank, our MovieAgent can generates multi-scene, multi-shot long-form videos with a coherent narrative, while ensuring character consistency, synchronized subtitles, and stable audio throughout the film. 2) MovieAgent introduces a hierarchical CoT-based reasoning process to automatically structure scenes, camera settings, and cinematography, significantly reducing human effort. By employing multiple LLM agents to simulate the roles of a director, screenwriter, storyboard artist, and location manager, MovieAgent streamlines the production pipeline. Experiments demonstrate that MovieAgent achieves new state-of-the-art results in script faithfulness, character consistency, and narrative coherence. Our hierarchical framework takes a step forward and provides new insights into fully automated movie generation. The code and project website are available at: https://github.com/showlab/MovieAgent and https://weijiawu.github.io/MovieAgent.

Weijia Wu, Zeyu Zhu, Mike Zheng Shou• 2025

Related benchmarks

TaskDatasetResultRank
Long Video GenerationVBench
Overall Score97.6
35
Long Video GenerationViStory Self-Consistency
ViStory-Self Score0.913
15
Long Video GenerationMovieBench
MovieBench Score27.962
15
Long Video GenerationMSVE-Bench
MSVE-Bench (NB-Q)73.8
15
Long Video GenerationStoryMem
StoryMem Score98
15
Long Video GenerationViStory (Cross-Frame)
ViStory-Cross28.6
15
Visual Storytelling ConsistencyViStoryBench
CSD (Self)0.479
13
Generative Video StorytellingGenAd-Bench
VAF61.2
11
Sketch Comedy Video GenerationMedian Sketch Comedies (test)
Win Rate13
10
Multi-shot Cinematic Video GenerationMulti-shot Cinematic Video Generation (test)
AQ (Aesthetic Quality)57.42
9
Showing 10 of 28 rows

Other info

Follow for update