Compact Generalized Non-local Network

About

The non-local module is designed for capturing long-range spatio-temporal dependencies in images and videos. Although having shown excellent performance, it lacks the mechanism to model the interactions between positions across channels, which are of vital importance in recognizing fine-grained objects and actions. To address this limitation, we generalize the non-local module and take the correlations between the positions of any two channels into account. This extension utilizes the compact representation for multiple kernel functions with Taylor expansion that makes the generalized non-local module in a fast and low-complexity computation flow. Moreover, we implement our generalized non-local method within channel groups to ease the optimization. Experimental results illustrate the clear-cut improvements and practical applicability of the generalized non-local module on both fine-grained object recognition and video classification. Code is available at: https://github.com/KaiyuYue/cgnl-network.pytorch.

Kaiyu Yue, Ming Sun, Yuchen Yuan, Feng Zhou, Errui Ding, Fuxin Xu• 2018

Related benchmarks

Task	Dataset	Result
Instance Segmentation	COCO 2017 (val)	--	1275
Object Detection	PASCAL VOC 2007 (test)	--	844
Image Classification	CUB-200-2011 (test)	Top-1 Acc87	303
Object Detection	MS-COCO (test)	--	81
Object Detection	MS-COCO 2017 (val)	Box AP35.7	32
Video Classification	Mini-Kinetics-200 (val)	Top-1 Acc79.9	11
Instance Segmentation	MS-COCO (test)	AP (Mask)35.4	11

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord