EvoIR-Agent: Self-Evolving Image Restoration Agentic System via Experience-Driven Learning
About
Multimodal Large Language Model (MLLM)-driven image restoration agent demonstrates effectiveness in degradation coupling scenarios by flexibly selecting tools and determining removal orders. However, their zero-shot planning often fails without experience, necessitating severe trial-and-error overhead to achieve satisfactory outcomes. Currently, two paradigms are employed to address this issue, yet a dilemma persists: Training-based methods embed intrinsic experience into parameters, achieving high inference efficiency but lacking compatibility with new tools or degradation. In contrast, training-free methods utilize explicit experience storage for compatibility but still incur trial-and-error overhead due to naive experience. To resolve the dilemma, we propose EvoIR-Agent, which first systematically formulates the experience components of a training-free image restoration agent. Subsequently, a hierarchical experience pool is constructed, which enables coarse-to-fine guidance for diverse tools and removal orders. Furthermore, a self-evolving mechanism is introduced to update the pool from scratch using accumulated records, thereby greatly improving performance and efficiency. Extensive experiments reveal that EvoIR-Agent achieves a significant lead in the full reference metrics and yields a remarkable Pareto-optimal balance between performance and efficiency compared to the state-of-the-art methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Restoration | MiO100 AgenticIR setting (Group A) | PSNR23.86 | 47 | |
| Image Restoration | MiO100 AgenticIR setting (Group B) | PSNR23.41 | 47 | |
| Image Restoration | MiO100 Group C (test) | PSNR22.12 | 12 | |
| Image Restoration | FoundIR (Low Light + Noise) | PSNR17.81 | 3 | |
| Image Restoration | FoundIR Blur + JPEG (B+J) | PSNR22.77 | 3 |