CANVAS: Continuity-Aware Narratives via Visual Agentic Storyboarding

About

Long-form visual storytelling requires maintaining continuity across shots, including consistent characters, stable environments, and smooth scene transitions. While existing generative models can produce strong individual frames, they fail to preserve such continuity, leading to appearance changes, inconsistent backgrounds, and abrupt scene shifts. We introduce CANVAS (Continuity-Aware Narratives via Visual Agentic Storyboarding), a multi-agent framework that explicitly plans visual continuity in multi-shot narratives. CANVAS enforces coherence through character continuity, persistent background anchors, and location-aware scene planning for smooth transitions within the same setting We evaluate CANVAS on two storyboard generation benchmarks ST-BENCH and ViStoryBench and introduce a new challenging benchmark HardContinuityBench for long-range narrative consistency. CANVAS consistently outperforms the best-performing baseline, improving background continuity by 21.6%, character consistency by 9.6% and props consistency by 7.6%.

Ishani Mondal, Yiwen Song, Mihir Parmar, Palash Goyal, Jordan Boyd-Graber, Tomas Pfister, Yale Song• 2026

Related benchmarks

Task	Dataset	Result
Visual Storytelling	ViStoryBench Lite 2025	CSD (Cross)0.491	21
Video Generation	FilMaster evaluation suite	Script Faithfulness (SF)4	9
Story-image generation	ViStoryBench Lite	CSD (Cross)0.49	5
Storyboard Generation	ViStoryBench Lite	CSD (Cross)0.49	5
Storyboard Generation	ViStoryBench	Background Consistency (Consecutive)4.83	5
Storyboard Generation	ST-Bench	BG Consistency (Consecutive)4.94	5
Storyboard Generation	HardContinuity	Background Consistency (Consecutive)4.88	5
Visual Storytelling	ContinuityEval	Background Consistency (Consecutive)4.6	5

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord