Restoration Adaptation for Semantic Segmentation on Low Quality Images
About
In real-world scenarios, the performance of semantic segmentation often deteriorates when processing low-quality (LQ) images, which may lack clear semantic structures and high-frequency details. Although image restoration techniques offer a promising direction for enhancing degraded visual content, conventional real-world image restoration (Real-IR) models primarily focus on pixel-level fidelity and often fail to recover task-relevant semantic cues, limiting their effectiveness when directly applied to downstream vision tasks. Conversely, existing segmentation models trained on high-quality data lack robustness under real-world degradations. In this paper, we propose Restoration Adaptation for Semantic Segmentation (RASS), which effectively integrates semantic image restoration into the segmentation process, enabling high-quality semantic segmentation on the LQ images directly. Specifically, we first propose a Semantic-Constrained Restoration (SCR) model, which injects segmentation priors into the restoration model by aligning its cross-attention maps with segmentation masks, encouraging semantically faithful image reconstruction. Then, RASS transfers semantic restoration knowledge into segmentation through LoRA-based module merging and task-specific fine-tuning, thereby enhancing the model's robustness to LQ images. To validate the effectiveness of our framework, we construct a real-world LQ image segmentation dataset with high-quality annotations, and conduct extensive experiments on both synthetic and real-world LQ benchmarks. The results show that SCR and RASS significantly outperform state-of-the-art methods in segmentation and restoration tasks. Code, models, and datasets will be available at https://github.com/Ka1Guan/RASS.git.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Super-Resolution | DIV2K | PSNR23.73 | 101 | |
| Image Super-resolution | DRealSR | MANIQA0.6689 | 78 | |
| Image Super-resolution | RealSR | PSNR25.34 | 71 | |
| Semantic segmentation | ADE20K degraded (val) | mIoU47.42 | 17 | |
| Semantic segmentation | RealLQ | mIoU39.8 | 17 | |
| Image Restoration | RealSR | PSNR25.34 | 12 | |
| Image Restoration | DIV2K | PSNR23.75 | 6 | |
| Semantic segmentation | ADE20K 150 standard (val) | mIoU53.76 | 5 | |
| Semantic segmentation | ADE20K Synthetic in-distribution (val) | mIoU47.42 | 5 | |
| Semantic segmentation | RealLQ RD Real-world Degradation (val) | mIoU39.8 | 5 |