Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FlowIt: Global Matching for Optical Flow with Confidence-Guided Refinement

About

We present FlowIt, a novel architecture for optical flow estimation designed to robustly handle large pixel displacements. At its core, FlowIt leverages a hierarchical transformer architecture that captures extensive global context, enabling the model to effectively model long-range correspondences. To overcome the limitations of localized matching, we formulate the flow initialization as an optimal transport problem. This formulation yields a highly robust initial flow field, alongside explicitly derived occlusion and confidence maps. These cues are then seamlessly integrated into a guided refinement stage, where the network actively propagates reliable motion estimates from high-confidence regions into ambiguous, low-confidence areas. Extensive experiments across the Sintel, KITTI, Spring, and LayeredFlow datasets validate the efficacy of our approach. FlowIt achieves state-of-the-art results on the competitive Sintel and KITTI benchmarks, while simultaneously establishing new state-of-the-art cross-dataset zero-shot generalization performance on Sintel, Spring, and LayeredFlow.

Sadra Safadoust, Fabio Tosi, Matteo Poggi, Fatma G\"uney• 2026

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationKITTI 2015
Fl-all3.81
60
Optical FlowSintel Clean
EPE0.93
59
Optical FlowSintel Final
EPE2.29
59
Showing 3 of 3 rows

Other info

Follow for update