Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers

About

We propose a novel Siamese Natural Language Tracker (SNLT), which brings the advancements in visual tracking to the tracking by natural language (NL) descriptions task. The proposed SNLT is applicable to a wide range of Siamese trackers, providing a new class of baselines for the tracking by NL task and promising future improvements from the advancements of Siamese trackers. The carefully designed architecture of the Siamese Natural Language Region Proposal Network (SNL-RPN), together with the Dynamic Aggregation of vision and language modalities, is introduced to perform the tracking by NL task. Empirical results over tracking benchmarks with NL annotations show that the proposed SNLT improves Siamese trackers by 3 to 7 percentage points with a slight tradeoff of speed. The proposed SNLT outperforms all NL trackers to-date and is competitive among state-of-the-art real-time trackers on LaSOT benchmarks while running at 50 frames per second on a single GPU.

Qi Feng, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff• 2019

Related benchmarks

TaskDatasetResultRank
Object TrackingLaSoT
AUC54
333
Visual Object TrackingTNL2K
AUC27.6
95
Visual Object TrackingTNL2k (test)
AUC27.6
74
Vision-Language TrackingOTB 99
AUC67
70
Vision-Language TrackingTNL2k (test)
AUC27.6
49
Visual Object TrackingOTB99 (test)
AUC67
29
Visual Object TrackingOTB Lang
Success Rate67
20
Natural Language TrackingTNL-2K
AUC27.6
19
Vision-Language TrackingLaSOT ext
AUC0.262
18
Natural Language TrackingOTB Lang
AUC67
17
Showing 10 of 12 rows

Other info

Follow for update