Simple multi-dataset detection
About
How do we build a general and broad object detection system? We use all labels of all concepts ever annotated. These labels span diverse datasets with potentially inconsistent taxonomies. In this paper, we present a simple method for training a unified detector on multiple large-scale datasets. We use dataset-specific training protocols and losses, but share a common detection architecture with dataset-specific outputs. We show how to automatically integrate these dataset-specific outputs into a common semantic taxonomy. In contrast to prior work, our approach does not require manual taxonomy reconciliation. Experiments show our learned taxonomy outperforms a expert-designed taxonomy in all datasets. Our multi-dataset detector performs as well as dataset-specific models on each training domain, and can generalize to new unseen dataset without fine-tuning on them. Code is available at https://github.com/xingyizhou/UniDet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | COCO 2017 (val) | AP45.5 | 2643 | |
| Instance Segmentation | COCO 2017 (val) | -- | 1201 | |
| Instance Segmentation | Cityscapes (val) | -- | 239 | |
| Object Detection | DroneVehicle (test) | mAP5077.17 | 67 | |
| Instance Segmentation | ScanNet (val) | -- | 62 | |
| Object Detection | Objects365 (val) | mAP33.7 | 60 | |
| Object Detection | CrowdHuman (val) | -- | 52 | |
| Object Detection | SARDet-100K (test) | MAP53.81 | 33 | |
| Object Detection | Cityscapes (val) | mAP500.526 | 31 | |
| Object Detection | DOTA (test) | mAP46.49 | 21 |