Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TransFlow: Transformer as Flow Learner

About

Optical flow is an indispensable building block for various important computer vision tasks, including motion estimation, object tracking, and disparity measurement. In this work, we propose TransFlow, a pure transformer architecture for optical flow estimation. Compared to dominant CNN-based methods, TransFlow demonstrates three advantages. First, it provides more accurate correlation and trustworthy matching in flow estimation by utilizing spatial self-attention and cross-attention mechanisms between adjacent frames to effectively capture global dependencies; Second, it recovers more compromised information (e.g., occlusion and motion blur) in flow estimation through long-range temporal association in dynamic scenes; Third, it enables a concise self-learning paradigm and effectively eliminate the complex and laborious multi-stage pre-training procedures. We achieve the state-of-the-art results on the Sintel, KITTI-15, as well as several downstream tasks, including video object detection, interpolation and stabilization. For its efficacy, we hope TransFlow could serve as a flexible baseline for optical flow estimation.

Yawen Lu, Qifan Wang, Siqi Ma, Tong Geng, Yingjie Victor Chen, Huaijin Chen, Dongfang Liu• 2023

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationKITTI 2015 (train)
Fl-epe0.49
446
Optical FlowSintel (train)
AEPE (Clean)0.42
200
Optical Flow EstimationSintel Final (test)
EPE2.33
133
Optical FlowSintel (test)
AEPE (Final)2.08
120
Optical Flow EstimationSintel clean (test)
EPE1.06
120
Optical Flow EstimationKITTI 2015
Fl-all4.32
60
Optical Flow EstimationKITTI-15 (test)
Fl-all Error4.32
53
Optical Flow EstimationMPI Sintel Final Pass
Overall AEE2.08
29
Optical Flow EstimationSintel generalization Clean
EPE0.93
12
Optical Flow EstimationKITTI-15 generalization (test)
Fl-epe3.98
12
Showing 10 of 10 rows

Other info

Follow for update