Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Spatial-Aware Regressions for Visual Tracking

About

In this paper, we analyze the spatial information of deep features, and propose two complementary regressions for robust visual tracking. First, we propose a kernelized ridge regression model wherein the kernel value is defined as the weighted sum of similarity scores of all pairs of patches between two samples. We show that this model can be formulated as a neural network and thus can be efficiently solved. Second, we propose a fully convolutional neural network with spatially regularized kernels, through which the filter kernel corresponding to each output channel is forced to focus on a specific region of the target. Distance transform pooling is further exploited to determine the effectiveness of each output channel of the convolution layer. The outputs from the kernelized ridge regression model and the fully convolutional neural network are combined to obtain the ultimate response. Experimental results on two benchmark datasets validate the effectiveness of the proposed method.

Chong Sun, Dong Wang, Huchuan Lu, Ming-Hsuan Yang• 2017

Related benchmarks

TaskDatasetResultRank
Visual Object TrackingVOT 2016
EAO32.3
79
Visual Object TrackingVOT 2015
EAO0.324
61
Visual Object TrackingOTB 2015
AUC67.2
58
Visual Object TrackingOTB-100
AUC67.2
21
Short-Term TrackingVOT 2017 2018
EAO32.3
19
Object TrackingVOT 2018
EAO0.323
19
Object TrackingVOT 2017 (test)
EAO0.055
19
Visual Object TrackingOTB 2013
AUC67.7
17
Visual Object TrackingVOT 2017 6.0.3 (test)
EAO0.323
14
Visual Object TrackingVOT 2017 2018 (test)
EAO0.323
9
Showing 10 of 11 rows

Other info

Code

Follow for update