Relation Networks for Object Detection
About
Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era. All state-of-the-art object detection systems still rely on recognizing object instances individually, without exploiting their relations during learning. This work proposes an object relation module. It processes a set of objects simultaneously through interaction between their appearance feature and geometry, thus allowing modeling of their relations. It is lightweight and in-place. It does not require additional supervision and is easy to embed in existing networks. It is shown effective on improving object recognition and duplicate removal steps in the modern object detection pipeline. It verifies the efficacy of modeling object relations in CNN based detection. It gives rise to the first fully end-to-end object detector.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | COCO (test-dev) | mAP39 | 1195 | |
| Scene Graph Generation | Visual Genome (test) | R@500.275 | 86 | |
| Scene Graph Generation | Open Images v6 (test) | wmAPrel34.2 | 74 | |
| Scene Graph Classification | Visual Genome (test) | -- | 63 | |
| Pedestrian Detection | CrowdHuman (val) | MR^-248.2 | 61 | |
| Predicate Classification | Visual Genome (test) | R@5036 | 50 | |
| Pedestrian Detection | CrowdHuman (test) | MR48.2 | 16 | |
| Lesion Detection | CVA-BUS high-quality labels re-annotated version | Pr@8091.4 | 16 | |
| Scene Graph Detection (SGDet) | Visual Genome (test) | AP5026.4 | 3 |