GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

About

The Non-Local Network (NLNet) presents a pioneering approach for capturing long-range dependencies, via aggregating query-specific global context to each query position. However, through a rigorous empirical analysis, we have found that the global contexts modeled by non-local network are almost the same for different query positions within an image. In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation. We further observe that this simplified design shares similar structure with Squeeze-Excitation Network (SENet). Hence we unify them into a three-step general framework for global context modeling. Within the general framework, we design a better instantiation, called the global context (GC) block, which is lightweight and can effectively model the global context. The lightweight property allows us to apply it for multiple layers in a backbone network to construct a global context network (GCNet), which generally outperforms both simplified NLNet and SENet on major benchmarks for various recognition tasks. The code and configurations are released at https://github.com/xvjiarui/GCNet.

Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu• 2019

Related benchmarks

Task	Dataset	Result
Semantic segmentation	ADE20K (val)	mIoU45.2	3069
Object Detection	COCO 2017 (val)	AP47.9	2843
Image Classification	ImageNet-1k (val)	Top-1 Accuracy77.7	1498
Instance Segmentation	COCO 2017 (val)	--	1275
Semantic segmentation	Cityscapes (test)	--	1252
Object Detection	COCO (test-dev)	mAP52.3	1239
Classification	ImageNet-1K 1.0 (val)	Top-1 Accuracy (%)78.93	1171
Object Detection	COCO (val)	mAP40.5	637
Object Detection	COCO v2017 (test-dev)	mAP48.4	499
Instance Segmentation	COCO (val)	APmk36.4	485

Showing 10 of 21 rows

Other info

Code

Follow for update

@wizwand_team Discord