Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalized Relation Modeling for Transformer Tracking

About

Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which allows earlier interaction between the template and search region, has achieved a remarkable performance gain. However, existing one-stream trackers always let the template interact with all parts inside the search region throughout all the encoder layers. This could potentially lead to target-background confusion when the extracted feature representations are not sufficiently discriminative. To alleviate this issue, we propose a generalized relation modeling method based on adaptive token division. The proposed method is a generalized formulation of attention-based relation modeling for Transformer tracking, which inherits the merits of both previous two-stream and one-stream pipelines whilst enabling more flexible relation modeling by selecting appropriate search tokens to interact with template tokens. An attention masking strategy and the Gumbel-Softmax technique are introduced to facilitate the parallel computation and end-to-end learning of the token division module. Extensive experiments show that our method is superior to the two-stream and one-stream pipelines and achieves state-of-the-art performance on six challenging benchmarks with a real-time running speed.

Shenyuan Gao, Chunluan Zhou, Jun Zhang• 2023

Related benchmarks

TaskDatasetResultRank
Visual Object TrackingTrackingNet (test)
Normalized Precision (Pnorm)88.9
463
Visual Object TrackingLaSOT (test)
AUC71.4
446
Object TrackingLaSoT
AUC71.4
411
Visual Object TrackingGOT-10k (test)
Average Overlap73.4
408
Object TrackingTrackingNet
Precision (P)84
270
Visual Object TrackingGOT-10k
AO73.4
254
Visual Object TrackingUAV123 (test)
AUC70.2
188
Visual Object TrackingNFS (Need for Speed) 30 FPS (test)
AUC65.6
54
Single Object TrackingTrackingNet
Pnorm88.9
52
Visual Object TrackingGOT-10k 1.0 (test)
AO73.4
51
Showing 10 of 24 rows

Other info

Code

Follow for update