SnowFormer: Context Interaction Transformer with Scale-awareness for Single Image Desnowing
About
Due to various and complicated snow degradations, single image desnowing is a challenging image restoration task. As prior arts can not handle it ideally, we propose a novel transformer, SnowFormer, which explores efficient cross-attentions to build local-global context interaction across patches and surpasses existing works that employ local operators or vanilla transformers. Compared to prior desnowing methods and universal image restoration methods, SnowFormer has several benefits. Firstly, unlike the multi-head self-attention in recent image restoration Vision Transformers, SnowFormer incorporates the multi-head cross-attention mechanism to perform local-global context interaction between scale-aware snow queries and local-patch embeddings. Second, the snow queries in SnowFormer are generated by the query generator from aggregated scale-aware features, which are rich in potential clean cues, leading to superior restoration results. Third, SnowFormer outshines advanced state-of-the-art desnowing networks and the prevalent universal image restoration transformers on six synthetic and real-world datasets. The code is released in \url{https://github.com/Ephemeral182/SnowFormer}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Single Image Desnowing | CSD 2000 (test) | PSNR41.35 | 26 | |
| Image Desnowing | CSD (test) | PSNR39.45 | 16 | |
| Single Image Desnowing | SRRS 2000 (test) | PSNR34.99 | 15 | |
| Single Image Desnowing | Snow 100K 2000 (test) | PSNR37.03 | 15 | |
| Single Image Desnowing | SnowKITTI 2000 (test) | PSNR38.96 | 11 | |
| Image Desnowing | Snow100K (real-world) | IL-NIQE22.185 | 9 | |
| Snow Removal | LLFF Fern synthetic snowflakes (test) | PSNR28.34 | 4 | |
| Snow Removal | DTU House synthetic snowflakes (test) | PSNR27.93 | 4 |