Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UniFixer: A Universal Reference-Guided Fixer for Diffusion-Based View Synthesis

About

With the recent surge of generative models, diffusion-based approaches have become mainstream for view synthesis tasks, either in an explicit depth-warp-inpaint or in an implicit end-to-end manner. Despite their success, both paradigms often suffer from noticeable quality degradation, e.g., blurred details and distorted structures, caused by pixel-to-latent compression and diffusion hallucination. In this paper, we investigate diffusion degradation from three key dimensions (i.e., spatial, temporal, and backbone-related) and propose UniFixer, a universal reference-guided framework that fixes diverse degradation artifacts via a coarse-to-fine strategy. Specifically, a reference pre-alignment module is first designed to perform coarse alignment between the reference view and the degraded novel view. A global structure anchoring mechanism then rectifies geometric distortions to ensure structural fidelity, followed by a local detail injection module that recovers fine-grained texture details for high-quality view synthesis. Our UniFixer serves as a plug-and-play refiner that achieves zero-shot fixing across different types of diffusion degradation, and extensive experiments verify our state-of-the-art performance on novel view synthesis and stereo conversion.

Sihan Chen, Xiang Zhang, Yang Zhang, Tunc Aydin, Christopher Schroers• 2026

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisDL3DV
PSNR24.32
40
Stereo ConversionMono2Stereo
PSNR30.84
14
Stereo ConversionSpring
PSNR29.35
8
Novel View SynthesisSpring
CLIP-IQA0.482
4
Novel View SynthesisSVD
CLIP-IQA0.531
4
Stereo ConversionSpring
CLIP-IQA0.452
4
Stereo ConversionSVD
CLIP-IQA Score53.8
4
Showing 7 of 7 rows

Other info

Follow for update