DRIFT: Deep Restoration, ISP Fusion, and Tone-mapping
About
Smartphone cameras have gained immense popularity with the adoption of high-resolution and high-dynamic range imaging. As a result, high-performance camera Image Signal Processors (ISPs) are crucial in generating high-quality images for the end user while keeping computational costs low. In this paper, we propose DRIFT (Deep Restoration, ISP Fusion, and Tone-mapping): an efficient AI mobile camera pipeline that generates high quality RGB images from hand-held raw captures. The first stage of DRIFT is a Multi-Frame Processing (MFP) network that is trained using a adversarial perceptual loss to perform multi-frame alignment, denoising, demosaicing, and super-resolution. Then, the output of DRIFT-MFP is processed by a novel deep-learning based tone-mapping (DRIFT-TM) solution that allows for tone tunability, ensures tone-consistency with a reference pipeline, and can be run efficiently for high-resolution images on a mobile device. We show qualitative and quantitative comparisons against state-of-the-art MFP and tone-mapping methods to demonstrate the effectiveness of our approach.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-frame 4x Super-Resolution | 150 12MP images (test) | LPIPS0.1 | 9 | |
| Multi-frame Denoising | Custom Handheld 12MP Raw Captures 1.0 (test) | LPIPS0.05 | 8 | |
| Tone Mapping | held-out (test) | PSNR40.59 | 6 | |
| Tone Mapping | 12MP resolution (test) | TMQI-Q0.845 | 5 | |
| Denoising | DRIFT-MFP Denoising User Study 12-MP resolution bursts | User Preference Score63 | 2 |