Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

About

Attention mechanism has demonstrated great potential in fine-grained visual recognition tasks. In this paper, we present a counterfactual attention learning method to learn more effective attention based on causal inference. Unlike most existing methods that learn visual attention based on conventional likelihood, we propose to learn the attention with counterfactual causality, which provides a tool to measure the attention quality and a powerful supervisory signal to guide the learning process. Specifically, we analyze the effect of the learned visual attention on network prediction through counterfactual intervention and maximize the effect to encourage the network to learn more useful attention for fine-grained image recognition. Empirically, we evaluate our method on a wide range of fine-grained recognition tasks where attention plays a crucial role, including fine-grained image categorization, person re-identification, and vehicle re-identification. The consistent improvement on all benchmarks demonstrates the effectiveness of our method. Code is available at https://github.com/raoyongming/CAL

Yongming Rao, Guangyi Chen, Jiwen Lu, Jie Zhou• 2021

Related benchmarks

TaskDatasetResultRank
Person Re-IdentificationMarket1501 (test)
Rank-1 Accuracy95.5
1264
Person Re-IdentificationMarket 1501
mAP87
1136
Person Re-IdentificationDuke MTMC-reID (test)
Rank-190
1023
Person Re-IdentificationDukeMTMC-reID
Rank-1 Acc87.2
667
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy90.6
567
Person Re-IdentificationMSMT17
mAP0.64
546
Person Re-IdentificationMSMT17 (test)
Rank-1 Acc84.2
517
Person Re-IdentificationMarket-1501 (test)
Rank-194.5
417
Fine-grained Image ClassificationStanford Cars (test)
Accuracy95.5
372
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc94.2
312
Showing 10 of 27 rows

Other info

Code

Follow for update