Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving
About
The field of autonomous driving increasingly demands high-quality annotated video training data. In this paper, we propose Panacea+, a powerful and universally applicable framework for generating video data in driving scenes. Built upon the foundation of our previous work, Panacea, Panacea+ adopts a multi-view appearance noise prior mechanism and a super-resolution module for enhanced consistency and increased resolution. Extensive experiments show that the generated video samples from Panacea+ greatly benefit a wide range of tasks on different datasets, including 3D object tracking, 3D object detection, and lane detection tasks on the nuScenes and Argoverse 2 dataset. These results strongly prove Panacea+ to be a valuable data generation framework for autonomous driving.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | nuScenes (val) | NDS49.2 | 941 | |
| Video Generation | nuScenes (val) | FVD139 | 37 | |
| Multi-view Driving Video Generation | NuScenes v1.0 (test) | FVD139 | 11 | |
| Camera Generation | nuScenes (val) | FID16.96 | 10 | |
| 3D Object Detection | nuScenes (T+I)2V scenarios (val) | NDS32.1 | 8 |