I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors
About
Recent works on two-stage cross-domain detection have widely explored the local feature patterns to achieve more accurate adaptation results. These methods heavily rely on the region proposal mechanisms and ROI-based instance-level features to design fine-grained feature alignment modules with respect to the foreground objects. However, for one-stage detectors, it is hard or even impossible to obtain explicit instance-level features in the detection pipelines. Motivated by this, we propose an Implicit Instance-Invariant Network (I3Net), which is tailored for adapting one-stage detectors and implicitly learns instance-invariant features via exploiting the natural characteristics of deep features in different layers. Specifically, we facilitate the adaptation from three aspects: (1) Dynamic and Class-Balanced Reweighting (DCBR) strategy, which considers the coexistence of intra-domain and intra-class variations to assign larger weights to those sample-scarce categories and easy-to-adapt samples; (2) Category-aware Object Pattern Matching (COPM) module, which boosts the cross-domain foreground objects matching guided by the categorical information and suppresses the uninformative background features; (3) Regularized Joint Category Alignment (RJCA) module, which jointly enforces the category alignment at different domain-specific layers with a consistency regularization. Experiments reveal that I3Net exceeds the state-of-the-art performance on benchmark datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | Watercolor2k (test) | mAP (Overall)51.5 | 113 | |
| Object Detection | Clipart1k (test) | mAP37.8 | 70 | |
| Object Detection | PASCAL VOC to Water Color (test) | mAP51.5 | 64 | |
| Object Detection | Comic2k (test) | mAP30.1 | 62 | |
| Object Detection | VOC to Watercolor (target) | mAP51.5 | 31 | |
| Object Detection | VOC to Comic (test) | mAP30.1 | 20 | |
| Object Detection | PASCAL VOC → Clipart1k (test) | AP (aero)30 | 20 | |
| Object Detection | Watercolor (test) | Bike Prediction Error81.1 | 17 | |
| Object Detection | Comic V→Co (test) | AP (bicycle)47.5 | 13 | |
| Object Detection | Comic (test) | Bike Error (Eperf)47.5 | 12 |