Task-Driven Super Resolution: Object Detection in Low-resolution Images
About
We consider how image super resolution (SR) can contribute to an object detection task in low-resolution images. Intuitively, SR gives a positive impact on the object detection task. While several previous works demonstrated that this intuition is correct, SR and detector are optimized independently in these works. This paper proposes a novel framework to train a deep neural network where the SR sub-network explicitly incorporates a detection loss in its training objective, via a tradeoff with a traditional detection loss. This end-to-end training procedure allows us to train SR preprocessing for any differentiable detector. We demonstrate that our task-driven SR consistently and significantly improves accuracy of an object detector on low-resolution images for a variety of conditions and scaling factors.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D human shape and pose estimation | 3DPW | MPJPE-PA66.62 | 30 | |
| 3D human shape and pose estimation | MPI-INF-3DHP | MPJPE-PA72.13 | 29 | |
| Image Classification | CUB-200 2011 x4 (test) | Top-1 Accuracy0.721 | 10 | |
| Image Classification | StanfordCars x4 (test) | Top-1 Accuracy82 | 10 | |
| Image Classification | CUB-200 x8 2011 (test) | Top-1 Accuracy58.7 | 10 | |
| Object Detection | PASCAL VOC x4 scale 2012 (test) | mAP29.7 | 10 | |
| Image Classification | StanfordCars x8 (test) | Top-1 Accuracy0.61 | 10 | |
| Object Detection | PASCAL VOC x8 scale 2012 (test) | mAP18.9 | 10 | |
| Semantic segmentation | PASCAL VOC 2012 x4 (test) | mIoU56.6 | 10 | |
| Semantic segmentation | PASCAL VOC 2012 x8 (test) | mIoU49.2 | 10 |