Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Twofold Siamese Network for Real-Time Object Tracking

About

Observing that Semantic features learned in an image classification task and Appearance features learned in a similarity matching task complement each other, we build a twofold Siamese network, named SA-Siam, for real-time object tracking. SA-Siam is composed of a semantic branch and an appearance branch. Each branch is a similarity-learning Siamese network. An important design choice in SA-Siam is to separately train the two branches to keep the heterogeneity of the two types of features. In addition, we propose a channel attention mechanism for the semantic branch. Channel-wise weights are computed according to the channel activations around the target position. While the inherited architecture from SiamFC \cite{SiamFC} allows our tracker to operate beyond real-time, the twofold design and the attention mechanism significantly improve the tracking performance. The proposed SA-Siam outperforms all other real-time trackers by a large margin on OTB-2013/50/100 benchmarks.

Anfeng He, Chong Luo, Xinmei Tian, Wenjun Zeng• 2018

Related benchmarks

TaskDatasetResultRank
Visual Object TrackingOTB-100
AUC65.7
136
Visual Object TrackingVOT 2016
EAO29.1
79
Visual TrackingVOT 2016 (test)
EAO0.2911
70
Object TrackingOTB 2015 (test)
AUC0.656
63
Visual Object TrackingVOT 2015
EAO0.31
61
Visual Object TrackingOTB 2013
AUC67.7
60
Visual Object TrackingOTB 2015
AUC65.7
58
Visual Object TrackingOTB100 (test)
AUC0.657
41
Visual TrackingVOT 2015 (test)
Accuracy59
20
Visual Object TrackingOTB-50
AUC0.61
20
Showing 10 of 20 rows

Other info

Follow for update