Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Siamese Box Adaptive Network for Visual Tracking

About

Most of the existing trackers usually rely on either a multi-scale searching scheme or pre-defined anchor boxes to accurately estimate the scale and aspect ratio of a target. Unfortunately, they typically call for tedious and heuristic configurations. To address this issue, we propose a simple yet effective visual tracking framework (named Siamese Box Adaptive Network, SiamBAN) by exploiting the expressive power of the fully convolutional network (FCN). SiamBAN views the visual tracking problem as a parallel classification and regression problem, and thus directly classifies objects and regresses their bounding boxes in a unified FCN. The no-prior box design avoids hyper-parameters associated with the candidate boxes, making SiamBAN more flexible and general. Extensive experiments on visual tracking benchmarks including VOT2018, VOT2019, OTB100, NFS, UAV123, and LaSOT demonstrate that SiamBAN achieves state-of-the-art performance and runs at 40 FPS, confirming its effectiveness and efficiency. The code will be available at https://github.com/hqucv/siamban.

Zedu Chen, Bineng Zhong, Guorong Li, Shengping Zhang, Rongrong Ji• 2020

Related benchmarks

TaskDatasetResultRank
Visual Object TrackingTrackingNet (test)
Normalized Precision (Pnorm)79.5
460
Visual Object TrackingLaSOT (test)
AUC51.4
444
Object TrackingLaSoT
AUC51.4
333
Visual Object TrackingUAV123 (test)
AUC63.1
188
Visual Object TrackingUAV123
AUC0.631
165
Visual Object TrackingOTB-100
AUC69.6
136
Visual Object TrackingNfS
AUC0.594
112
Visual Object TrackingTNL2K--
95
Object TrackingVisEvent (test)
PR59.1
63
Visual Object TrackingVOT 2018 (test)
EAO0.452
54
Showing 10 of 35 rows

Other info

Follow for update