Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning A Physical-aware Diffusion Model Based on Transformer for Underwater Image Enhancement

About

Underwater visuals undergo various complex degradations, inevitably influencing the efficiency of underwater vision tasks. Recently, diffusion models were employed to underwater image enhancement (UIE) tasks, and gained SOTA performance. However, these methods fail to consider the physical properties and underwater imaging mechanisms in the diffusion process, limiting information completion capacity of diffusion models. In this paper, we introduce a novel UIE framework, named PA-Diff, designed to exploiting the knowledge of physics to guide the diffusion process. PA-Diff consists of Physics Prior Generation (PPG) Branch, Implicit Neural Reconstruction (INR) Branch, and Physics-aware Diffusion Transformer (PDT) Branch. Our designed PPG branch aims to produce the prior knowledge of physics. With utilizing the physics prior knowledge to guide the diffusion process, PDT branch can obtain underwater-aware ability and model the complex distribution in real-world underwater scenes. INR Branch can learn robust feature representations from diverse underwater image via implicit neural representation, which reduces the difficulty of restoration for PDT branch. Extensive experiments prove that our method achieves best performance on UIE tasks.

Chen Zhao, Chenyu Dong, Weiling Cai, Yueyue Wang• 2024

Related benchmarks

TaskDatasetResultRank
Underwater Image EnhancementU45
UCIQE0.593
33
Underwater Image EnhancementChallenge
UCIQE0.586
23
Underwater Image EnhancementEUVP
UCIQE59.9
21
Underwater Image EnhancementUIEB
PSNR23.47
13
Underwater Image EnhancementLSUI
Inference Time (s)0.23
5
Showing 5 of 5 rows

Other info

Follow for update