Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Deformable Siamese Attention Networks for Visual Object Tracking

About

Siamese-based trackers have achieved excellent performance on visual object tracking. However, the target template is not updated online, and the features of the target template and search image are computed independently in a Siamese architecture. In this paper, we propose Deformable Siamese Attention Networks, referred to as SiamAttn, by introducing a new Siamese attention mechanism that computes deformable self-attention and cross-attention. The self attention learns strong context information via spatial attention, and selectively emphasizes interdependent channel-wise features with channel attention. The cross-attention is capable of aggregating rich contextual inter-dependencies between the target template and the search image, providing an implicit manner to adaptively update the target template. In addition, we design a region refinement module that computes depth-wise cross correlations between the attentional features for more accurate tracking. We conduct experiments on six benchmarks, where our method achieves new state of-the-art results, outperforming the strong baseline, SiamRPN++ [24], by 0.464->0.537 and 0.415->0.470 EAO on VOT 2016 and 2018. Our code is available at: https://github.com/msight-tech/research-siamattn.

Yuechen Yu, Yilei Xiong, Weilin Huang, Matthew R. Scott• 2020

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean79.2
1130
Visual Object TrackingTrackingNet (test)
Normalized Precision (Pnorm)81.7
460
Visual Object TrackingLaSOT (test)
AUC56
444
Video Object SegmentationYouTube-VOS 2019 (val)
J-Score (Seen)79.6
231
Visual Object TrackingUAV123 (test)
AUC65
188
Visual Object TrackingUAV123
AUC0.65
165
Visual Object TrackingOTB-100
AUC71.2
136
Visual Object TrackingNfS
AUC0.639
112
Visual Object TrackingVOT 2018 (test)
EAO0.47
54
Visual Object TrackingOTB 2015 (test)
AUC Score71.2
47
Showing 10 of 17 rows

Other info

Follow for update