FlowIID: Single-Step Intrinsic Image Decomposition via Latent Flow Matching
About
Intrinsic Image Decomposition (IID) separates an image into albedo and shading components. It is a core step in many real-world applications, such as relighting and material editing. Existing IID models achieve good results, but often use a large number of parameters. This makes them costly to combine with other models in real-world settings. To address this problem, we propose a flow matching-based solution. For this, we design a novel architecture, FlowIID, based on latent flow matching. FlowIID combines a VAE-guided latent space with a flow matching module, enabling a stable decomposition of albedo and shading. FlowIID is not only parameter-efficient, but also produces results in a single inference step. Despite its compact design, FlowIID delivers competitive and superior results compared to existing models across various benchmarks. This makes it well-suited for deployment in resource-constrained and real-time vision applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Albedo Estimation | ARAP | LMSE0.021 | 19 | |
| Shading Estimation | ARAP (test) | LMSE0.022 | 12 | |
| Albedo Estimation | ARAP (test) | LMSE0.021 | 11 | |
| Albedo Estimation | MIT Intrinsic Barron and Malik (test) | LMSE0.0043 | 1 | |
| Shading Estimation | MIT Intrinsic Barron and Malik (test) | LMSE0.0119 | 1 |