Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UASTrack: A Unified Adaptive Selection Framework with Modality-Customization in Single Object Tracking

About

Multi-modal tracking is essential in single-object tracking (SOT), as different sensor types contribute unique capabilities to overcome challenges caused by variations in object appearance. However, existing unified RGB-X trackers (X represents depth, event, or thermal modality) either rely on the task-specific training strategy for individual RGB-X image pairs or fail to address the critical importance of modality-adaptive perception in real-world applications. In this work, we propose UASTrack, a unified adaptive selection framework that facilitates both model and parameter unification, as well as adaptive modality discrimination across various multi-modal tracking tasks. To achieve modality-adaptive perception in joint RGB-X pairs, we design a Discriminative Auto-Selector (DAS) capable of identifying modality labels, thereby distinguishing the data distributions of auxiliary modalities. Furthermore, we propose a Task-Customized Optimization Adapter (TCOA) tailored to various modalities in the latent space. This strategy effectively filters noise redundancy and mitigates background interference based on the specific characteristics of each modality. Extensive comparisons conducted on five benchmarks including LasHeR, GTOT, RGBT234, VisEvent, and DepthTrack, covering RGB-T, RGB-E, and RGB-D tracking scenarios, demonstrate our innovative approach achieves comparative performance by introducing only additional training parameters of 1.87M and flops of 1.95G. The code will be available at https://github.com/wanghe/UASTrack.

He Wang, Tianyang Xu, Zhangyong Tang, Xiao-Jun Wu, Josef Kittler• 2025

Related benchmarks

TaskDatasetResultRank
RGB-T TrackingGTOT
PR93.3
138
RGB-T TrackingRGBT234
Precision87.6
121
RGBT TrackingLasHeR
PR71.1
120
Visual Object TrackingDepthTrack
Recall0.625
91
Object TrackingVisEvent--
46
Object TrackingLasHeR (test)
SR57
11
Showing 6 of 6 rows

Other info

Follow for update