Domain-Specific Suppression for Adaptive Object Detection
About
Domain adaptation methods face performance degradation in object detection, as the complexity of tasks require more about the transferability of the model. We propose a new perspective on how CNN models gain the transferability, viewing the weights of a model as a series of motion patterns. The directions of weights, and the gradients, can be divided into domain-specific and domain-invariant parts, and the goal of domain adaptation is to concentrate on the domain-invariant direction while eliminating the disturbance from domain-specific one. Current UDA object detection methods view the two directions as a whole while optimizing, which will cause domain-invariant direction mismatch even if the output features are perfectly aligned. In this paper, we propose the domain-specific suppression, an exemplary and generalizable constraint to the original convolution gradients in backpropagation to detach the two parts of directions and suppress the domain-specific one. We further validate our theoretical analysis and methods on several domain adaptive object detection tasks, including weather, camera configuration, and synthetic to real-world adaptation. Our experiment results show significant advance over the state-of-the-art methods in the UDA object detection field, performing a promotion of $10.2\sim12.2\%$ mAP on all these domain adaptation scenarios.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | Cityscapes to Foggy Cityscapes (test) | mAP47.8 | 196 | |
| Object Detection | Foggy Cityscapes (test) | mAP (Mean Average Precision)40.9 | 108 | |
| Object Detection | Sim10K → Cityscapes (test) | AP (Car)58.6 | 104 | |
| Object Detection | KITTI → Cityscapes (test) | AP (Car)59.2 | 62 | |
| Object Detection | Cityscapes to Foggy Cityscapes (val) | mAP47.8 | 57 | |
| Object Detection | Cityscapes -> Foggy Cityscapes | mAP40.9 | 55 | |
| Object Detection | Sim10k to Cityscapes | AP (Car)44.5 | 51 | |
| Object Detection | Cityscapes adaptation from KITTI (val) | mAP42.7 | 46 | |
| Object Detection | Sim10k to Cityscapes (S2C) | mAP44.5 | 39 | |
| Object Detection | Cityscapes S -> C adaptation (val) | mAP44.5 | 37 |