Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Global Context Networks

About

The Non-Local Network (NLNet) presents a pioneering approach for capturing long-range dependencies within an image, via aggregating query-specific global context to each query position. However, through a rigorous empirical analysis, we have found that the global contexts modeled by the non-local network are almost the same for different query positions. In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation. We further replace the one-layer transformation function of the non-local block by a two-layer bottleneck, which further reduces the parameter number considerably. The resulting network element, called the global context (GC) block, effectively models global context in a lightweight manner, allowing it to be applied at multiple layers of a backbone network to form a global context network (GCNet). Experiments show that GCNet generally outperforms NLNet on major benchmarks for various recognition tasks. The code and network configurations are available at https://github.com/xvjiarui/GCNet.

Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu• 2020

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO (test-dev)
mAP52.3
1239
Image ClassificationImageNet (val)
Top-1 Acc77.49
1206
Object DetectionCOCO v2017 (test-dev)
mAP52.3
499
Instance SegmentationCOCO (test-dev)--
380
Semantic segmentationCityscapes (val)
mIoU78.95
374
Instance SegmentationCOCO 2017 (test-dev)
AP (Overall)45.4
253
Action RecognitionKinetics-400 full (val)
Top-1 Acc76
141
Object DetectionCOCO mini (val)
AP51.8
132
Instance SegmentationCOCO mini (val)
AP^m44.7
72
Semantic segmentationCOCO-EA
mR92
26
Showing 10 of 14 rows

Other info

Code

Follow for update