Relation Networks for Object Detection

About

Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era. All state-of-the-art object detection systems still rely on recognizing object instances individually, without exploiting their relations during learning. This work proposes an object relation module. It processes a set of objects simultaneously through interaction between their appearance feature and geometry, thus allowing modeling of their relations. It is lightweight and in-place. It does not require additional supervision and is easy to embed in existing networks. It is shown effective on improving object recognition and duplicate removal steps in the modern object detection pipeline. It verifies the efficacy of modeling object relations in CNN based detection. It gives rise to the first fully end-to-end object detector.

Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei• 2017

Related benchmarks

Task	Dataset	Result
Object Detection	COCO (test-dev)	mAP39	1239
Scene Graph Generation	Visual Genome (test)	R@500.275	86
Scene Graph Generation	Open Images v6 (test)	wmAPrel34.2	74
Scene Graph Classification	Visual Genome (test)	--	63
Pedestrian Detection	CrowdHuman (val)	MR^-248.2	61
Predicate Classification	Visual Genome (test)	R@5036	50
Pedestrian Detection	CrowdHuman (test)	MR48.2	16
Lesion Detection	CVA-BUS high-quality labels re-annotated version	Pr@8091.4	16
Scene Graph Detection (SGDet)	Visual Genome (test)	AP5026.4	3

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord