SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking

About

By decomposing the visual tracking task into two subproblems as classification for pixel category and regression for object bounding box at this pixel, we propose a novel fully convolutional Siamese network to solve visual tracking end-to-end in a per-pixel manner. The proposed framework SiamCAR consists of two simple subnetworks: one Siamese subnetwork for feature extraction and one classification-regression subnetwork for bounding box prediction. Our framework takes ResNet-50 as backbone. Different from state-of-the-art trackers like Siamese-RPN, SiamRPN++ and SPM, which are based on region proposal, the proposed framework is both proposal and anchor free. Consequently, we are able to avoid the tricky hyper-parameter tuning of anchors and reduce human intervention. The proposed framework is simple, neat and effective. Extensive experiments and comparisons with state-of-the-art trackers are conducted on many challenging benchmarks like GOT-10K, LaSOT, UAV123 and OTB-50. Without bells and whistles, our SiamCAR achieves the leading performance with a considerable real-time speed.

Dongyan Guo, Jun Wang, Ying Cui, Zhenhua Wang, Shengyong Chen• 2019

Related benchmarks

Task	Dataset	Result
Visual Object Tracking	LaSOT (test)	AUC50.7	470
Visual Object Tracking	GOT-10k (test)	Average Overlap56.9	450
Visual Object Tracking	UAV123	AUC0.614	193
Visual Object Tracking	UAV123 (test)	AUC61.4	188
Visual Object Tracking	OTB 2015 (test)	AUC Score69.7	47
Visual Object Tracking	LaSOT 1.0 (test)	AUC50.7	42
Object Tracking	SatSOT	--	18
Anti-UAV Tracking	Anti-UAV318 (test)	AUC48.1	17
Object Tracking	SV248S	S Score41.4	17
Anti-UAV Tracking	DUT Anti-UAV (test)	AUC0.526	17

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord