Adaptive Object Detection with Dual Multi-Label Prediction
About
In this paper, we propose a novel end-to-end unsupervised deep domain adaptation model for adaptive object detection by exploiting multi-label object recognition as a dual auxiliary task. The model exploits multi-label prediction to reveal the object category information in each image and then uses the prediction results to perform conditional adversarial global feature alignment, such that the multi-modal structure of image features can be tackled to bridge the domain divergence at the global feature level while preserving the discriminability of the features. Moreover, we introduce a prediction consistency regularization mechanism to assist object detection, which uses the multi-label prediction results as an auxiliary regularization information to ensure consistent object category discoveries between the object recognition task and the object detection task. Experiments are conducted on a few benchmark datasets and the results show the proposed model outperforms the state-of-the-art comparison methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | Cityscapes to Foggy Cityscapes (test) | mAP38.8 | 196 | |
| Object Detection | Watercolor2k (test) | mAP (Overall)56 | 113 | |
| Object Detection | Foggy Cityscapes (test) | mAP (Mean Average Precision)38.8 | 108 | |
| Object Detection | Foggy Cityscapes (val) | mAP38.8 | 67 | |
| Object Detection | PASCAL VOC to Water Color (test) | mAP56 | 64 | |
| Object Detection | Comic2k (test) | mAP33.5 | 62 | |
| Object Detection | VOC to Watercolor (target) | mAP56 | 31 | |
| Object Detection | VOC to Comic (test) | mAP33.5 | 20 | |
| Object Detection | Watercolor (test) | Bike Prediction Error87.9 | 17 | |
| Object Detection | Comic V→Co (test) | AP (bicycle)47.9 | 13 |