AiATrack: Attention in Attention for Transformer Visual Tracking

About

Transformer trackers have achieved impressive advancements recently, where the attention mechanism plays an important role. However, the independent correlation computation in the attention mechanism could result in noisy and ambiguous attention weights, which inhibits further performance improvement. To address this issue, we propose an attention in attention (AiA) module, which enhances appropriate correlations and suppresses erroneous ones by seeking consensus among all correlation vectors. Our AiA module can be readily applied to both self-attention blocks and cross-attention blocks to facilitate feature aggregation and information propagation for visual tracking. Moreover, we propose a streamlined Transformer tracking framework, dubbed AiATrack, by introducing efficient feature reuse and target-background embeddings to make full use of temporal references. Experiments show that our tracker achieves state-of-the-art performance on six tracking benchmarks while running at a real-time speed.

Shenyuan Gao, Chunluan Zhou, Chao Ma, Xinggang Wang, Junsong Yuan• 2022

Related benchmarks

Task	Dataset	Result
Visual Object Tracking	TrackingNet (test)	Normalized Precision (Pnorm)87.8	502
Object Tracking	LaSoT	AUC69.6	498
Visual Object Tracking	LaSOT (test)	AUC69	470
Visual Object Tracking	GOT-10k (test)	Average Overlap69.6	450
Object Tracking	TrackingNet	Precision (P)80.4	327
Visual Object Tracking	GOT-10k	AO69.6	306
RGB-D Object Tracking	VOT-RGBD 2022 (public challenge)	EAO64.1	263
RGB-T Tracking	LasHeR (test)	PR46.3	257
Visual Object Tracking	UAV123 (test)	AUC70.6	188
RGB-D Object Tracking	DepthTrack (test)	Precision50.5	181

Showing 10 of 45 rows

Other info

Code

Follow for update

@wizwand_team Discord