VQ-Jarvis: Retrieval-Augmented Video Restoration Agent with Sharp Vision and Fast Thought
About
Video restoration in real-world scenarios is challenged by heterogeneous degradations, where static architectures and fixed inference pipelines often fail to generalize. Recent agent-based approaches offer dynamic decision making, yet existing video restoration agents remain limited by insufficient quality perception and inefficient search strategies. We propose VQ-Jarvis, a retrieval-augmented, all-in-one intelligent video restoration agent with sharper vision and faster thought. VQ-Jarvis is designed to accurately perceive degradations and subtle differences among paired restoration results, while efficiently discovering optimal restoration trajectories. To enable sharp vision, we construct VSR-Compare, the first large-scale video paired enhancement dataset with 20K comparison pairs covering 7 degradation types, 11 enhancement operators, and diverse content domains. Based on this dataset, we train a multiple operator judge model and a degradation perception model to guide agent decisions. To achieve fast thought, we introduce a hierarchical operator scheduling strategy that adapts to video difficulty: for easy cases, optimal restoration trajectories are retrieved in a one-step manner from a retrieval-augmented generation (RAG) library; for harder cases, a step-by-step greedy search is performed to balance efficiency and accuracy. Extensive experiments demonstrate that VQ-Jarvis consistently outperforms existing methods on complex degraded videos.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Quality Assessment | KoNViD-1k | SROCC0.891 | 183 | |
| Video Quality Assessment | LIVE-VQC | SRCC0.82 | 111 | |
| Video Quality Assessment | LSVQ (test) | SRCC0.893 | 84 | |
| Video Quality Assessment | LSVQ 1080p | SRCC0.809 | 78 | |
| Video Quality Assessment | FineVQ | Color PLCC0.8937 | 10 | |
| Video Restoration | YouHQ40 | LPIPS0.312 | 6 | |
| Video Restoration | UDM10 | LPIPS0.288 | 6 | |
| Video Restoration | Constructed Benchmark (Group 1) | PSNR18.7 | 5 | |
| Video Restoration | Constructed Benchmark (Group 3) | PSNR16.03 | 5 | |
| Video Restoration | Constructed Benchmark (Group 2) | PSNR22.52 | 5 |