Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalized Relation Modeling for Transformer Tracking

About

Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which allows earlier interaction between the template and search region, has achieved a remarkable performance gain. However, existing one-stream trackers always let the template interact with all parts inside the search region throughout all the encoder layers. This could potentially lead to target-background confusion when the extracted feature representations are not sufficiently discriminative. To alleviate this issue, we propose a generalized relation modeling method based on adaptive token division. The proposed method is a generalized formulation of attention-based relation modeling for Transformer tracking, which inherits the merits of both previous two-stream and one-stream pipelines whilst enabling more flexible relation modeling by selecting appropriate search tokens to interact with template tokens. An attention masking strategy and the Gumbel-Softmax technique are introduced to facilitate the parallel computation and end-to-end learning of the token division module. Extensive experiments show that our method is superior to the two-stream and one-stream pipelines and achieves state-of-the-art performance on six challenging benchmarks with a real-time running speed.

Shenyuan Gao, Chunluan Zhou, Jun Zhang• 2023

Related benchmarks

TaskDatasetResultRank
Visual Object TrackingTrackingNet (test)
Normalized Precision (Pnorm)88.9
502
Object TrackingLaSoT
AUC71.4
498
Visual Object TrackingLaSOT (test)
AUC71.4
470
Visual Object TrackingGOT-10k (test)
Average Overlap73.4
450
Object TrackingTrackingNet
Precision (P)84
327
Visual Object TrackingGOT-10k
AO73.4
306
Visual Object TrackingUAV123 (test)
AUC70.2
188
Single Object TrackingTrackingNet
Pnorm88.9
72
Visual Object TrackingNFS (Need for Speed) 30 FPS (test)
AUC65.6
54
Visual Object TrackingGOT-10k 1.0 (test)
AO73.4
51
Showing 10 of 27 rows

Other info

Code

Follow for update