Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Deep Features for Discriminative Localization

About

In this work, we revisit the global average pooling layer proposed in [13], and shed light on how it explicitly enables the convolutional neural network to have remarkable localization ability despite being trained on image-level labels. While this technique was previously proposed as a means for regularizing training, we find that it actually builds a generic localizable deep representation that can be applied to a variety of tasks. Despite the apparent simplicity of global average pooling, we are able to achieve 37.1% top-5 error for object localization on ILSVRC 2014, which is remarkably close to the 34.2% top-5 error achieved by a fully supervised CNN approach. We demonstrate that our network is able to localize the discriminative image regions on a variety of tasks despite not being trained for them

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba• 2015

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (test)
mIoU32.2
1252
Semantic segmentationCamVid (test)
mIoU6.6
411
Image ClassificationCUB-200-2011 (test)--
303
Semantic segmentationCityscapes (val)
mIoU33
301
Instance SegmentationPASCAL VOC 2012 (val)
mAP @0.57.8
173
Visual Question AnsweringVQA (test-dev)
Acc (All)58.91
147
Semantic segmentationPASCAL VOC 2012 (train)
mIoU58.1
88
Weakly Supervised Object LocalizationCUB (test)
Top-1 Loc Acc56.1
80
Object LocalizationImageNet-1k (val)
Top-1 Loc Acc46.3
80
Weakly Supervised Object LocalizationCUB
MaxBoxAccV263.7
69
Showing 10 of 69 rows

Other info

Code

Follow for update