Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization

About

Weakly supervised object localization aims to find a target object region in a given image with only weak supervision, such as image-level labels. Most existing methods use a class activation map (CAM) to generate a localization map; however, a CAM identifies only the most discriminative parts of a target object rather than the entire object region. In this work, we find the gap between classification and localization in terms of the misalignment of the directions between an input feature and a class-specific weight. We demonstrate that the misalignment suppresses the activation of CAM in areas that are less discriminative but belong to the target object. To bridge the gap, we propose a method to align feature directions with a class-specific weight. The proposed method achieves a state-of-the-art localization performance on the CUB-200-2011 and ImageNet-1K benchmarks.

Eunji Kim, Siwon Kim, Jungbeom Lee, Hyunwoo Kim, Sungroh Yoon• 2022

Related benchmarks

TaskDatasetResultRank
Object LocalizationImageNet-1k (val)
Top-1 Loc Acc53.76
80
Weakly Supervised Object LocalizationCUB (test)
Top-1 Loc Acc53.8
80
Object LocalizationCUB-200-2011 (test)
Top-1 Loc. Accuracy73.16
68
Weakly Supervised Object LocalizationCUB-200 (test)
Top-1 Loc Acc70.8
26
Weakly Supervised Object LocalizationImageNet 1k (test)
MaxBoxAccV2 (Mean)68.7
20
Object LocalizationCUB v2
Max Box Acc V280.1
20
Weakly Supervised Object LocalizationILSVRC (test)
Top-1 Loc Acc53.8
14
Weakly Supervised Object LocalizationCUB-200-2011 v2
MaxBoxAccV280.1
10
Object LocalizationOpenImages30K (test)
PxAP63.7
7
Showing 9 of 9 rows

Other info

Follow for update