Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pip-Stereo: Progressive Iterations Pruner for Iterative Optimization based Stereo Matching

About

While iterative stereo matching achieves high accuracy, its dependence on Recurrent Neural Networks (RNN) hinders edge deployment, a challenge underexplored in existing researches. We analyze iterative refinement and reveal that disparity updates are spatially sparse and temporally redundant. First, we introduce a progressive iteration pruning strategy that suppresses redundant update steps, effectively collapsing the recursive computation into a near-single-pass inference. Second, we propose a collaborative monocular prior transfer framework that implicitly embeds depth priors without requiring a dedicated monocular encoder, thereby eliminating its associated computational burden. Third, we develop FlashGRU, a hardware-aware RNN operator leveraging structured sparsity and I/O-conscious design, achieving a 7.28$\times$ speedup, 76.6\% memory peak reduction and 80.9\% global memory requests reduction over natvie ConvGRUs under 2K resolution. Our PipStereo enables real-time, high-fidelity stereo matching on edge hardware: it processes 320$\times$640 frames in just 75ms on an NVIDIA Jetson Orin NX (FP16) and 19ms on RTX 4090, matching the accuracy of large iterative based models, and our generalization ability and accuracy far exceeds that of existing real-time methods. Our embedded AI projects will be updated at: https://github.com/XPENG-Aridge-AI.

Jintu Zheng, Qizhe Liu, HuangXin Xu, Zhuojie Chen• 2026

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015
D1 Error (All)1.44
118
Stereo MatchingKITTI 2012
Error Rate (3px, Noc)0.92
81
Stereo MatchingScene Flow
EPE (px)0.45
40
Stereo MatchingETH3D
Threshold Error > 1px (All)0.67
30
Stereo MatchingDrivingStereo Zero-shot generalization
Error Rate (Sunny)3.27
15
Showing 5 of 5 rows

Other info

Follow for update