Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning A Physical-aware Diffusion Model Based on Transformer for Underwater Image Enhancement

About

Underwater visuals undergo various complex degradations, inevitably influencing the efficiency of underwater vision tasks. Recently, diffusion models were employed to underwater image enhancement (UIE) tasks, and gained SOTA performance. However, these methods fail to consider the physical properties and underwater imaging mechanisms in the diffusion process, limiting information completion capacity of diffusion models. In this paper, we introduce a novel UIE framework, named PA-Diff, designed to exploiting the knowledge of physics to guide the diffusion process. PA-Diff consists of Physics Prior Generation (PPG) Branch, Implicit Neural Reconstruction (INR) Branch, and Physics-aware Diffusion Transformer (PDT) Branch. Our designed PPG branch aims to produce the prior knowledge of physics. With utilizing the physics prior knowledge to guide the diffusion process, PDT branch can obtain underwater-aware ability and model the complex distribution in real-world underwater scenes. INR Branch can learn robust feature representations from diverse underwater image via implicit neural representation, which reduces the difficulty of restoration for PDT branch. Extensive experiments prove that our method achieves best performance on UIE tasks.

Chen Zhao, Chenyu Dong, Weiling Cai, Yueyue Wang• 2024

Related benchmarks

TaskDatasetResultRank
Underwater Image EnhancementU45
UCIQE0.593
23
Underwater Image EnhancementUIEB
PSNR23.47
13
Underwater Image EnhancementEUVP
PSNR29.12
13
Underwater Image EnhancementChallenge
UCIQE0.586
13
Underwater Image EnhancementLSUI
Inference Time (s)0.23
5
Showing 5 of 5 rows

Other info

Follow for update