Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

About

Recently, channel attention mechanism has demonstrated to offer great potential in improving the performance of deep convolutional neural networks (CNNs). However, most existing methods dedicate to developing more sophisticated attention modules for achieving better performance, which inevitably increase model complexity. To overcome the paradox of performance and complexity trade-off, this paper proposes an Efficient Channel Attention (ECA) module, which only involves a handful of parameters while bringing clear performance gain. By dissecting the channel attention module in SENet, we empirically show avoiding dimensionality reduction is important for learning channel attention, and appropriate cross-channel interaction can preserve performance while significantly decreasing model complexity. Therefore, we propose a local cross-channel interaction strategy without dimensionality reduction, which can be efficiently implemented via $1D$ convolution. Furthermore, we develop a method to adaptively select kernel size of $1D$ convolution, determining coverage of local cross-channel interaction. The proposed ECA module is efficient yet effective, e.g., the parameters and computations of our modules against backbone of ResNet50 are 80 vs. 24.37M and 4.7e-4 GFLOPs vs. 3.86 GFLOPs, respectively, and the performance boost is more than 2% in terms of Top-1 accuracy. We extensively evaluate our ECA module on image classification, object detection and instance segmentation with backbones of ResNets and MobileNetV2. The experimental results show our module is more efficient while performing favorably against its counterparts.

Qilong Wang, Banggu Wu, Pengfei Zhu, Peihua Li, Wangmeng Zuo, Qinghua Hu• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy94.68
3381
Object DetectionCOCO 2017 (val)
AP41.3
2643
Image ClassificationImageNet-1k (val)
Top-1 Accuracy78.92
1469
Image ClassificationImageNet (val)--
1206
Instance SegmentationCOCO 2017 (val)
APm0.411
1201
Image ClassificationImageNet
Top-1 Accuracy78.65
431
Image ClassificationILSVRC 2012 (val)--
156
Hyperspectral Semantic SegmentationH-City C: 128 (test)
mIoU51.28
45
Object DetectionDSEC (test)
mAP (Car)36.7
29
Fracture DetectionGRAZPEDWRI-DX 1.0 (test)
F1 Score65
24
Showing 10 of 19 rows

Other info

Code

Follow for update