Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
About
Recent advancements in image relighting models, driven by large-scale datasets and pre-trained diffusion models, have enabled the imposition of consistent lighting. However, video relighting still lags, primarily due to the excessive training costs and the scarcity of diverse, high-quality video relighting datasets. A simple application of image relighting models on a frame-by-frame basis leads to several issues: lighting source inconsistency and relighted appearance inconsistency, resulting in flickers in the generated videos. In this work, we propose Light-A-Video, a training-free approach to achieve temporally smooth video relighting. Adapted from image relighting models, Light-A-Video introduces two key techniques to enhance lighting consistency. First, we design a Consistent Light Attention (CLA) module, which enhances cross-frame interactions within the self-attention layers of the image relight model to stabilize the generation of the background lighting source. Second, leveraging the physical principle of light transport independence, we apply linear blending between the source video's appearance and the relighted appearance, using a Progressive Light Fusion (PLF) strategy to ensure smooth temporal transitions in illumination. Experiments show that Light-A-Video improves the temporal consistency of relighted video while maintaining the relighted image quality, ensuring coherent lighting transitions across frames. Project page: https://bujiazi.github.io/light-a-video.github.io/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Relighting | VBench | Dynamic Degree0.78 | 6 | |
| Video Relighting | 60 videos (81 frames each) | Scaled Average Relighting Time276 | 5 | |
| Video Relighting | Dataset of 50 videos | AQ0.6157 | 5 | |
| Video Relighting | Video Relighting Dataset 100 clips (test) | SSIM0.604 | 5 | |
| Video Relighting | Static lighting scenes (test) | PSNR_light (dB)16.63 | 5 | |
| Video Harmonization | Curated Portrait Video Dataset | PSNR15.64 | 5 | |
| Foreground Video Relighting | Background image-conditioned foreground video relighting dataset (test) | Aesthetic Score0.619 | 5 | |
| Video Relighting | In-the-wild data | Motion Preservation0.4557 | 4 | |
| Video Relighting | Video Relighting Dataset (test) | Aesthetic Score0.614 | 4 | |
| Video Relighting | Real in-the-wild videos | PSNR12.66 | 3 |