Vision Transformers for Single Image Dehazing

About

Image dehazing is a representative low-level vision task that estimates latent haze-free images from hazy images. In recent years, convolutional neural network-based methods have dominated image dehazing. However, vision Transformers, which has recently made a breakthrough in high-level vision tasks, has not brought new dimensions to image dehazing. We start with the popular Swin Transformer and find that several of its key designs are unsuitable for image dehazing. To this end, we propose DehazeFormer, which consists of various improvements, such as the modified normalization layer, activation function, and spatial information aggregation scheme. We train multiple variants of DehazeFormer on various datasets to demonstrate its effectiveness. Specifically, on the most frequently used SOTS indoor set, our small model outperforms FFA-Net with only 25% #Param and 5% computational cost. To the best of our knowledge, our large model is the first method with the PSNR over 40 dB on the SOTS indoor set, dramatically outperforming the previous state-of-the-art methods. We also collect a large-scale realistic remote sensing dehazing dataset for evaluating the method's capability to remove highly non-homogeneous haze.

Yuda Song, Zhuqing He, Hui Qian, Xin Du• 2022

Related benchmarks

Task	Dataset	Result
Image Denoising	BSD68	PSNR30.89	419
Image Deblurring	GoPro	PSNR25.93	414
Deraining	Rain100L	PSNR33.68	280
Dehazing	SOTS	PSNR31.78	238
Image Dehazing	SOTS	PSNR31.78	171
Low-light Image Enhancement	LOL	PSNR21.31	162
Image Dehazing	SOTS (test)	PSNR31.78	161
Image Dehazing	SOTS Outdoor	PSNR34.29	124
Image Dehazing	SOTS Indoor	PSNR40.05	83
Image Dehazing	SOTS Outdoor (test)	PSNR34.95	82

Showing 10 of 80 rows

...

Other info

Code

Follow for update

@wizwand_team Discord