Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rectifying Latent Space for Generative Single-Image Reflection Removal

About

Single-image reflection removal is a highly ill-posed problem, where existing methods struggle to reason about the composition of corrupted regions, causing them to fail at recovery and generalization in the wild. This work reframes an editing-purpose latent diffusion model to effectively perceive and process highly ambiguous, layered image inputs, yielding high-quality outputs. We argue that the challenge of this conversion stems from a critical yet overlooked issue, i.e., the latent space of semantic encoders lacks the inherent structure to interpret a composite image as a linear superposition of its constituent layers. Our approach is built on three synergistic components, including a reflection-equivariant VAE that aligns the latent space with the linear physics of reflection formation, a learnable task-specific text embedding for precise guidance that bypasses ambiguous language, and a depth-guided early-branching sampling strategy to harness generative stochasticity for promising results. Extensive experiments reveal that our model achieves new SOTA performance on multiple benchmarks and generalizes well to challenging real-world cases.

Mingjia Li, Jin Hu, Hainuo Wang, Qiming Hu, Jiarui Wang, Xiaojie Guo• 2025

Related benchmarks

TaskDatasetResultRank
Single Image Reflection RemovalReal20 (test)
PSNR27.58
70
Single Image Reflection RemovalSIR2 454 (test)
PSNR28.08
11
Single Image Reflection RemovalNature 20 (test)
PSNR27.34
11
Single Image Reflection RemovalOpenRR (val)
Avg Success Rate96.6
3
Single Image Reflection RemovalNature
Average Success Rate96
3
Single Image Reflection RemovalReal20
Average Success Rate91
3
Single Image Reflection RemovalSIR2
Avg Success Rate78.5
3
Single Image Reflection RemovalPublic Benchmarks (test)
Success Rate90.5
3
Showing 8 of 8 rows

Other info

Follow for update