Real-Time MDNet

About

We present a fast and accurate visual tracking algorithm based on the multi-domain convolutional neural network (MDNet). The proposed approach accelerates feature extraction procedure and learns more discriminative models for instance classification; it enhances representation quality of target and background by maintaining a high resolution feature map with a large receptive field per activation. We also introduce a novel loss term to differentiate foreground instances across multiple domains and learn a more discriminative embedding of target objects with similar semantics. The proposed techniques are integrated into the pipeline of a well known CNN-based visual tracking algorithm, MDNet. We accomplish approximately 25 times speed-up with almost identical accuracy compared to MDNet. Our algorithm is evaluated in multiple popular tracking benchmark datasets including OTB2015, UAV123, and TempleColor, and outperforms the state-of-the-art real-time tracking methods consistently even without dataset-specific parameter tuning.

Ilchae Jung, Jeany Son, Mooyeol Baek, Bohyung Han• 2018

Related benchmarks

Task	Dataset	Result
Visual Object Tracking	TrackingNet (test)	Normalized Precision (Pnorm)69.4	502
Visual Object Tracking	LaSOT (test)	AUC39.7	470
Visual Object Tracking	UAV123	AUC0.528	193
Visual Object Tracking	UAV123 (test)	AUC52.8	188
Visual Object Tracking	NfS	AUC0.433	128
RGBT Tracking	RGBT-210	Precision Rate71.5	66
Visual Object Tracking	OTB 2015	AUC65	63
RGBT Tracking	RGBT 234	Precision Rate75.8	53
Visual Tracking	NfS (test)	AUC43.3	45
RGBT Tracking	VOT-RGBT 2019	EAO21.36	40

Showing 10 of 23 rows

Other info

Follow for update

@wizwand_team Discord