Deep Attentional Guided Image Filtering
About
Guided filter is a fundamental tool in computer vision and computer graphics which aims to transfer structure information from guidance image to target image. Most existing methods construct filter kernels from the guidance itself without considering the mutual dependency between the guidance and the target. However, since there typically exist significantly different edges in the two images, simply transferring all structural information of the guidance to the target would result in various artifacts. To cope with this problem, we propose an effective framework named deep attentional guided image filtering, the filtering process of which can fully integrate the complementary information contained in both images. Specifically, we propose an attentional kernel learning module to generate dual sets of filter kernels from the guidance and the target, respectively, and then adaptively combine them by modeling the pixel-wise dependency between the two images. Meanwhile, we propose a multi-scale guided image filtering module to progressively generate the filtering result with the constructed kernels in a coarse-to-fine manner. Correspondingly, a multi-scale fusion strategy is introduced to reuse the intermediate results in the coarse-to-fine process. Extensive experiments show that the proposed framework compares favorably with the state-of-the-art methods in a wide range of guided image filtering applications, such as guided super-resolution, cross-modality restoration, texture removal, and semantic segmentation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depth Super-Resolution | Lu Bicubic downsampling synthetic (test) | RMSE (x4)0.83 | 20 | |
| Depth Super-Resolution | NYU Bicubic downsampling synthetic v2 (test) | RMSE (x4)1.36 | 20 | |
| Depth Super-Resolution | Middlebury Bicubic downsampling synthetic (test) | RMSE (x4)1.15 | 20 | |
| Depth Super-Resolution | RGB-D-D Bicubic downsampling synthetic (test) | RMSE (4x)1.14 | 19 | |
| Depth Super-Resolution | Lu Nearest-neighbor downsampling synthetic (test) | RMSE (x4)1.96 | 17 | |
| Depth Super-Resolution | NYU Nearest-neighbor downsampling synthetic v2 (test) | RMSE (x4)2.35 | 17 | |
| Depth Super-Resolution | Middlebury Nearest-neighbor downsampling synthetic (test) | RMSE (x4)1.78 | 17 | |
| Depth Super-Resolution | RGB-D-D Nearest-neighbor downsampling synthetic (test) | RMSE (x4)1.78 | 15 | |
| Guided Depth Super-resolution | NYU V2 | RMSE (4x)1.36 | 12 | |
| Guided Depth Super-resolution | Lu | RMSE (x4)0.83 | 11 |