Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Seeking Similarities over Differences: Similarity-based Domain Alignment for Adaptive Object Detection

About

In order to robustly deploy object detectors across a wide range of scenarios, they should be adaptable to shifts in the input distribution without the need to constantly annotate new data. This has motivated research in Unsupervised Domain Adaptation (UDA) algorithms for detection. UDA methods learn to adapt from labeled source domains to unlabeled target domains, by inducing alignment between detector features from source and target domains. Yet, there is no consensus on what features to align and how to do the alignment. In our work, we propose a framework that generalizes the different components commonly used by UDA methods laying the ground for an in-depth analysis of the UDA design space. Specifically, we propose a novel UDA algorithm, ViSGA, a direct implementation of our framework, that leverages the best design choices and introduces a simple but effective method to aggregate features at instance-level based on visual similarity before inducing group alignment via adversarial training. We show that both similarity-based grouping and adversarial training allows our model to focus on coarsely aligning feature groups, without being forced to match all instances across loosely aligned domains. Finally, we examine the applicability of ViSGA to the setting where labeled data are gathered from different sources. Experiments show that not only our method outperforms previous single-source approaches on Sim2Real and Adverse Weather, but also generalizes well to the multi-source setting.

Farzaneh Rezaeianaran, Rakshith Shetty, Rahaf Aljundi, Daniel Olmeda Reino, Shanshan Zhang, Bernt Schiele• 2021

Related benchmarks

TaskDatasetResultRank
Object DetectionCityscapes to Foggy Cityscapes (test)
mAP43.3
196
Object DetectionFoggy Cityscapes (test)
mAP (Mean Average Precision)43.3
108
Object DetectionSim10K → Cityscapes (test)
AP (Car)49.3
104
Object DetectionCityscapes Adaptation from SIM-10k (val)
AP (Car)49.3
97
Object DetectionCityscapes to Foggy Cityscapes (val)
mAP43.3
57
Object DetectionSim10k to Cityscapes
AP (Car)49.3
51
Object DetectionCityscapes adaptation from KITTI (val)
mAP47.6
46
Object DetectionCityscapes S -> C adaptation (val)
mAP49.3
37
Object DetectionFoggy Cityscapes to Cityscapes (test)
AP (person)38.8
21
Object DetectionINBreast (test)
AP (car)49.3
11
Showing 10 of 12 rows

Other info

Follow for update