Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Conformer: Local Features Coupling Global Representations for Visual Recognition

About

Within Convolutional Neural Network (CNN), the convolution operations are good at extracting local features but experience difficulty to capture global representations. Within visual transformer, the cascaded self-attention modules can capture long-distance feature dependencies but unfortunately deteriorate local feature details. In this paper, we propose a hybrid network structure, termed Conformer, to take advantage of convolutional operations and self-attention mechanisms for enhanced representation learning. Conformer roots in the Feature Coupling Unit (FCU), which fuses local features and global representations under different resolutions in an interactive fashion. Conformer adopts a concurrent structure so that local features and global representations are retained to the maximum extent. Experiments show that Conformer, under the comparable parameter complexity, outperforms the visual transformer (DeiT-B) by 2.3% on ImageNet. On MSCOCO, it outperforms ResNet-101 by 3.7% and 3.6% mAPs for object detection and instance segmentation, respectively, demonstrating the great potential to be a general backbone network. Code is available at https://github.com/pengzhiliang/Conformer.

Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, Yaowei Wang, Jianbin Jiao, Qixiang Ye• 2021

Related benchmarks

TaskDatasetResultRank
Instance SegmentationCOCO 2017 (val)--
1275
ClassificationImageNet-1K 1.0 (val)
Top-1 Accuracy (%)84.1
1171
Semantic segmentationADE20K
mIoU22.11
1028
Image ClassificationImageNet-1k (val)
Top-1 Acc84.1
706
Semantic segmentationCOCO Stuff
mIoU26.37
399
Image ClassificationImageNet (val)
Top-1 Accuracy83.4
354
Object DetectionMS-COCO 2017 (val)--
264
Semantic segmentationPascal Context
mIoU40.03
217
Object DetectionMS-COCO--
208
Image ClassificationImageNet (val)
Top-1 Accuracy83.4
188
Showing 10 of 21 rows

Other info

Code

Follow for update