Seeking Similarities over Differences: Similarity-based Domain Alignment for Adaptive Object Detection

About

In order to robustly deploy object detectors across a wide range of scenarios, they should be adaptable to shifts in the input distribution without the need to constantly annotate new data. This has motivated research in Unsupervised Domain Adaptation (UDA) algorithms for detection. UDA methods learn to adapt from labeled source domains to unlabeled target domains, by inducing alignment between detector features from source and target domains. Yet, there is no consensus on what features to align and how to do the alignment. In our work, we propose a framework that generalizes the different components commonly used by UDA methods laying the ground for an in-depth analysis of the UDA design space. Specifically, we propose a novel UDA algorithm, ViSGA, a direct implementation of our framework, that leverages the best design choices and introduces a simple but effective method to aggregate features at instance-level based on visual similarity before inducing group alignment via adversarial training. We show that both similarity-based grouping and adversarial training allows our model to focus on coarsely aligning feature groups, without being forced to match all instances across loosely aligned domains. Finally, we examine the applicability of ViSGA to the setting where labeled data are gathered from different sources. Experiments show that not only our method outperforms previous single-source approaches on Sim2Real and Adverse Weather, but also generalizes well to the multi-source setting.

Farzaneh Rezaeianaran, Rakshith Shetty, Rahaf Aljundi, Daniel Olmeda Reino, Shanshan Zhang, Bernt Schiele• 2021

Related benchmarks

Task	Dataset	Result
Object Detection	Cityscapes to Foggy Cityscapes (test)	mAP43.3	196
Object Detection	Foggy Cityscapes (test)	AP (Person)38.8	161
Object Detection	Sim10K → Cityscapes (test)	AP (Car)49.3	104
Object Detection	Cityscapes Adaptation from SIM-10k (val)	AP (Car)49.3	97
Object Detection	Cityscapes to Foggy Cityscapes (val)	mAP43.3	57
Object Detection	Sim10k to Cityscapes	AP (Car)49.3	51
Object Detection	Cityscapes adaptation from KITTI (val)	mAP47.6	46
Object Detection	Cityscapes S -> C adaptation (val)	mAP49.3	37
Object Detection	Foggy Cityscapes to Cityscapes (test)	AP (person)38.8	21
Object Detection	INBreast (test)	AP (car)49.3	11

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord