Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GMFlow: Learning Optical Flow via Global Matching

About

Learning-based optical flow estimation has been dominated with the pipeline of cost volume with convolutions for flow regression, which is inherently limited to local correlations and thus is hard to address the long-standing challenge of large displacements. To alleviate this, the state-of-the-art framework RAFT gradually improves its prediction quality by using a large number of iterative refinements, achieving remarkable performance but introducing linearly increasing inference time. To enable both high accuracy and efficiency, we completely revamp the dominant flow regression pipeline by reformulating optical flow as a global matching problem, which identifies the correspondences by directly comparing feature similarities. Specifically, we propose a GMFlow framework, which consists of three main components: a customized Transformer for feature enhancement, a correlation and softmax layer for global feature matching, and a self-attention layer for flow propagation. We further introduce a refinement step that reuses GMFlow at higher feature resolution for residual flow prediction. Our new framework outperforms 31-refinements RAFT on the challenging Sintel benchmark, while using only one refinement and running faster, suggesting a new paradigm for accurate and efficient optical flow estimation. Code is available at https://github.com/haofeixu/gmflow.

Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, Dacheng Tao• 2021

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationKITTI 2015 (train)
Fl-epe7.77
446
Optical FlowSintel (train)
AEPE (Clean)1.08
200
Optical Flow EstimationSintel Final (test)
EPE2.48
133
Optical Flow EstimationSintel clean (test)
EPE1.74
120
Optical FlowSintel (test)
AEPE (Final)2.9
120
Optical FlowKITTI 2015 (test)--
109
Optical Flow EstimationKITTI 2015 (test)
Fl-all9.32
108
Optical FlowKITTI-15 (test)
Fl-all9.32
85
Optical FlowKITTI (train)
Fl-all0.287
84
Action RecognitionDiving-48 (test)
Top-1 Acc71.74
81
Showing 10 of 28 rows

Other info

Code

Follow for update