Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving

About

The field of autonomous driving increasingly demands high-quality annotated training data. In this paper, we propose Panacea, an innovative approach to generate panoramic and controllable videos in driving scenarios, capable of yielding an unlimited numbers of diverse, annotated samples pivotal for autonomous driving advancements. Panacea addresses two critical challenges: 'Consistency' and 'Controllability.' Consistency ensures temporal and cross-view coherence, while Controllability ensures the alignment of generated content with corresponding annotations. Our approach integrates a novel 4D attention and a two-stage generation pipeline to maintain coherence, supplemented by the ControlNet framework for meticulous control by the Bird's-Eye-View (BEV) layouts. Extensive qualitative and quantitative evaluations of Panacea on the nuScenes dataset prove its effectiveness in generating high-quality multi-view driving-scene videos. This work notably propels the field of autonomous driving by effectively augmenting the training dataset used for advanced BEV perception techniques.

Yuqing Wen, Yucheng Zhao, Yingfei Liu, Fan Jia, Yanhui Wang, Chong Luo, Chi Zhang, Tiancai Wang, Xiaoyan Sun, Xiangyu Zhang• 2023

Related benchmarks

TaskDatasetResultRank
Camera GenerationnuScenes v1.0-trainval (val)
FID16.96
11
Driving Scene GenerationnuScenes (val)
FID16.96
9
Controllable Image GenerationnuScenes (val)
FID17
7
Multi-view video generationnuScenes (val)
FID17
7
Showing 4 of 4 rows

Other info

Code

Follow for update