Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline

About

With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to achieve robust performance and wider application scenarios with the guidance of objects' temperature information. However, the lack of paired training samples is the main bottleneck for unlocking the power of RGB-T tracking. Since it is laborious to collect high-quality RGB-T sequences, recent benchmarks only provide test sequences. In this paper, we construct a large-scale benchmark with high diversity for visible-thermal UAV tracking (VTUAV), including 500 sequences with 1.7 million high-resolution (1920 $\times$ 1080 pixels) frame pairs. In addition, comprehensive applications (short-term tracking, long-term tracking and segmentation mask prediction) with diverse categories and scenes are considered for exhaustive evaluation. Moreover, we provide a coarse-to-fine attribute annotation, where frame-level attributes are provided to exploit the potential of challenge-specific trackers. In addition, we design a new RGB-T baseline, named Hierarchical Multi-modal Fusion Tracker (HMFT), which fuses RGB-T data in various levels. Numerous experiments on several datasets are conducted to reveal the effectiveness of HMFT and the complement of different fusion types. The project is available at here.

Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiang Ruan• 2022

Related benchmarks

TaskDatasetResultRank
RGB-T TrackingLasHeR (test)
PR46
244
RGB-T TrackingRGBT234 (test)
Precision Rate79.6
189
RGB-T TrackingGTOT
PR91.2
114
RGB-T TrackingRGBT234
Precision78.8
98
RGBT TrackingRGBT234
PR78.8
65
RGBT TrackingRGBT-210
Precision Rate78.6
54
RGBT TrackingRGBT 234
Precision Rate78.8
53
RGB-T TrackingRGBT210 (test)
PR78.6
32
RGBT TrackingVTUAV
Precision Rate75.8
21
RGB-T TrackingGTOT (test)
PR91.2
19
Showing 10 of 19 rows

Other info

Follow for update