Staple: Complementary Learners for Real-Time Tracking
About
Correlation Filter-based trackers have recently achieved excellent performance, showing great robustness to challenging situations exhibiting motion blur and illumination changes. However, since the model that they learn depends strongly on the spatial layout of the tracked object, they are notoriously sensitive to deformation. Models based on colour statistics have complementary traits: they cope well with variation in shape, but suffer when illumination is not consistent throughout a sequence. Moreover, colour distributions alone can be insufficiently discriminative. In this paper, we show that a simple tracker combining complementary cues in a ridge regression framework can operate faster than 80 FPS and outperform not only all entries in the popular VOT14 competition, but also recent and far more sophisticated trackers according to multiple benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual Object Tracking | TrackingNet (test) | Normalized Precision (Pnorm)60.3 | 460 | |
| Visual Object Tracking | LaSOT (test) | AUC24 | 444 | |
| Visual Object Tracking | UAV123 (test) | AUC45 | 188 | |
| Visual Object Tracking | UAV123 | AUC0.45 | 165 | |
| Visual Object Tracking | NfS | AUC0.41 | 112 | |
| Visual Object Tracking | VOT 2016 | EAO30 | 79 | |
| Visual Tracking | VOT 2016 (test) | EAO0.2952 | 70 | |
| Object Tracking | OTB 2015 (test) | AUC0.6 | 63 | |
| Visual Object Tracking | VOT 2015 | EAO0.3 | 61 | |
| Visual Object Tracking | OTB 2013 | AUC60 | 60 |