Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DenVisCoM: Dense Vision Correspondence Mamba for Efficient and Real-time Optical Flow and Stereo Estimation

About

In this work, we propose a novel Mamba block DenVisCoM, as well as a novel hybrid architecture specifically tailored for accurate and real-time estimation of optical flow and disparity estimation. Given that such multi-view geometry and motion tasks are fundamentally related, we propose a unified architecture to tackle them jointly. Specifically, the proposed hybrid architecture is based on DenVisCoM and a Transformer-based attention block that efficiently addresses real-time inference, memory footprint, and accuracy at the same time for joint estimation of motion and 3D dense perception tasks. We extensively analyze the benchmark trade-off of accuracy and real-time processing on a large number of datasets. Our experimental results and related analysis suggest that our proposed model can accurately estimate optical flow and disparity estimation in real time. All models and associated code are available at https://github.com/vimstereo/DenVisCoM.

Tushar Anand, Maheswar Bora, Antitza Dantcheva, Abhijit Das• 2026

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationSintel Final (test)--
101
Optical FlowKITTI 2015 (test)--
95
Optical FlowSintel clean (test)
AEE (Unmatched)7.903
37
Disparity EstimationKITTI 15
EPE0.27
11
Disparity EstimationvKITTI 2
EPE0.18
11
Disparity EstimationSintel
EPE0.51
11
Showing 6 of 6 rows

Other info

Follow for update