Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Robust Object Modeling for Visual Tracking

About

Object modeling has become a core part of recent tracking frameworks. Current popular tackers use Transformer attention to extract the template feature separately or interactively with the search region. However, separate template learning lacks communication between the template and search regions, which brings difficulty in extracting discriminative target-oriented features. On the other hand, interactive template learning produces hybrid template features, which may introduce potential distractors to the template via the cluttered search regions. To enjoy the merits of both methods, we propose a robust object modeling framework for visual tracking (ROMTrack), which simultaneously models the inherent template and the hybrid template features. As a result, harmful distractors can be suppressed by combining the inherent features of target objects with search regions' guidance. Target-related features can also be extracted using the hybrid template, thus resulting in a more robust object modeling framework. To further enhance robustness, we present novel variation tokens to depict the ever-changing appearance of target objects. Variation tokens are adaptable to object deformation and appearance variations, which can boost overall performance with negligible computation. Experiments show that our ROMTrack sets a new state-of-the-art on multiple benchmarks.

Yidong Cai, Jie Liu, Jie Tang, Gangshan Wu• 2023

Related benchmarks

TaskDatasetResultRank
Visual Object TrackingTrackingNet (test)
Normalized Precision (Pnorm)89
460
Visual Object TrackingLaSOT (test)
AUC71.4
444
Visual Object TrackingGOT-10k (test)
Average Overlap74.2
378
Object TrackingLaSoT
AUC71.4
333
Object TrackingTrackingNet
Precision (P)83.7
225
Visual Object TrackingGOT-10k
AO74.2
223
Visual Object TrackingUAV123 (test)
AUC70.5
188
Visual Object TrackingLaSoText
Precision58.6
88
Visual Object TrackingLaSOText (test)
AUC51.3
85
Visual Object TrackingGOT-10k 1.0 (test)
AO72.9
51
Showing 10 of 34 rows

Other info

Follow for update