Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network

About

Recovering sharp images from dual-pixel (DP) pairs with disparity-dependent blur is a challenging task.~Existing blur map-based deblurring methods have demonstrated promising results. In this paper, we propose, to the best of our knowledge, the first framework that introduces the contrastive language-image pre-training framework (CLIP) to accurately estimate the blur map from a DP pair unsupervisedly. To achieve this, we first carefully design text prompts to enable CLIP to understand blur-related geometric prior knowledge from the DP pair. Then, we propose a format to input a stereo DP pair to CLIP without any fine-tuning, despite the fact that CLIP is pre-trained on monocular images. Given the estimated blur map, we introduce a blur-prior attention block, a blur-weighting loss, and a blur-aware loss to recover the all-in-focus image. Our method achieves state-of-the-art performance in extensive experiments (see Fig.~\ref{fig:teaser}).

Hao Yang, Liyuan Pan, Yan Yang, Richard Hartley, Miaomiao Liu• 2023

Related benchmarks

TaskDatasetResultRank
Defocus DeblurringDDD-syn
PSNR38.62
12
Defocus DeblurringRDPD (test)
PSNR33.44
12
Defocus DeblurringDPD-blur Outdoor Scenes 1 (test)
PSNR29.88
12
Defocus DeblurringDPD-blur Indoor Scenes 1 (test)
PSNR24.73
12
Defocus DeblurringDPD-blur Average 1 (test)
PSNR27.24
12
Showing 5 of 5 rows

Other info

Follow for update