LiWi: Layering in the Wild
About
Recent advances in generative models have empowered impressive layered image generation, yet their success is largely confined to graphic design domains. The layering of in-the-wild images remains an underexplored problem, limiting fine-grained editing and applications of images in real-world scenarios. Specifically, challenges remain in scalable layered data and the modeling of object interaction in natural images, such as illumination effects and structural boundary. To address these bottlenecks, we propose a novel framework for high-fidelity natural image decomposition. First, we introduce an Agent-driven Data Decomposition (ADD) pipeline that orchestrates agents and tools to synthesize layered data without manual intervention. Utilizing this pipeline, we construct a large-scale dataset, named LiWi-100k, with over 100,000 high-quality layered in-the-wild images. Second, we present a novel framework that jointly improves photometric fidelity and alpha boundary accuracy. Specifically, shadow-guided learning explicitly models the illumination effects, and degradation-restoration objective provides boundary-correction supervision by recovering clean foreground image from degraded one. Extensive experiments demonstrate that our framework achieves state-of-the-art (SoTA) performance in natural image decomposition, outperforming existing models in RGB L1 and Alpha IoU metrics. We will soon release our code and dataset.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Dichotomous Image Segmentation | DIS5K (DIS-VD) | F_beta (Weighted)0.744 | 44 | |
| Dichotomous Image Segmentation | DIS5K TE (1-4) (test) | Fw_beta74.4 | 42 | |
| Media design decomposition into RGBA layers | Crello (test) | RGB L1 Error0.0321 | 32 | |
| Dichotomous Image Segmentation | DIS5K TE1 (test) | M6.4 | 20 | |
| Dichotomous Image Segmentation | DIS5K TE2 (test) | Fw_beta0.777 | 20 | |
| Dichotomous Image Segmentation | DIS5K TE3 (test) | Fw_beta76 | 20 | |
| Dichotomous Image Segmentation | DIS5K TE4 (test) | Fw_beta68.5 | 20 | |
| Layer Decomposition | LiWi 100k (test) | RGB L1 Error0.081 | 9 |