Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ZeroStereo: Zero-shot Stereo Matching from Single Images

About

State-of-the-art supervised stereo matching methods have achieved remarkable performance on various benchmarks. However, their generalization to real-world scenarios remains challenging due to the scarcity of annotated real-world stereo data. In this paper, we propose ZeroStereo, a novel stereo image generation pipeline for zero-shot stereo matching. Our approach synthesizes high-quality right images from arbitrary single images by leveraging pseudo disparities generated by a monocular depth estimation model. Unlike previous methods that address occluded regions by filling missing areas with neighboring pixels or random backgrounds, we fine-tune a diffusion inpainting model to recover missing details while preserving semantic structure. Additionally, we propose Training-Free Confidence Generation, which mitigates the impact of unreliable pseudo labels without additional training, and Adaptive Disparity Selection, which ensures a diverse and realistic disparity distribution while preventing excessive occlusion and foreground distortion. Experiments demonstrate that models trained with our pipeline achieve state-of-the-art zero-shot generalization across multiple datasets with only a dataset volume comparable to Scene Flow. Code: https://github.com/Windsrain/ZeroStereo.

Xianqi Wang, Hao Yang, Gangwei Xu, Junda Cheng, Min Lin, Yong Deng, Jinliang Zang, Yurui Chen, Xin Yang• 2025

Related benchmarks

TaskDatasetResultRank
Stereo MatchingDrivingStereo Zero-shot generalization
Error Rate (Sunny)3.15
15
World-centric geometry reconstructionDynamic Replica
delta^p0.929
13
2D-to-3D conversionDynamic Replica (val)
PSNR23.08
7
2D-to-3D conversionSVD AVP (test)
PSNR23.81
7
Stereo InpaintingHD-100 768×1280 (test)
PSNR24.73
7
Stereo View SynthesisDrivingstereo (full)
iSQoE79.64
5
Stereo View SynthesisBooster (test)
iSQoE75.03
5
Stereo View SynthesisLayeredFlow (test)
iSQoE0.8108
5
Stereo View SynthesisMiddlebury 2014 (full)
iSQoE0.7423
5
Showing 9 of 9 rows

Other info

Follow for update