Fine-Grained Classification with Noisy Labels
About
Learning with noisy labels (LNL) aims to ensure model generalization given a label-corrupted training set. In this work, we investigate a rarely studied scenario of LNL on fine-grained datasets (LNL-FG), which is more practical and challenging as large inter-class ambiguities among fine-grained classes cause more noisy labels. We empirically show that existing methods that work well for LNL fail to achieve satisfying performance for LNL-FG, arising the practical need of effective solutions for LNL-FG. To this end, we propose a novel framework called stochastic noise-tolerated supervised contrastive learning (SNSCL) that confronts label noise by encouraging distinguishable representation. Specifically, we design a noise-tolerated supervised contrastive learning loss that incorporates a weight-aware mechanism for noisy label correction and selectively updating momentum queue lists. By this mechanism, we mitigate the effects of noisy anchors and avoid inserting noisy labels into the momentum-updated queue. Besides, to avoid manually-defined augmentation strategies in contrastive learning, we propose an efficient stochastic module that samples feature embeddings from a generated distribution, which can also enhance the representation ability of deep models. SNSCL is general and compatible with prevailing robust LNL strategies to improve their performance for LNL-FG. Extensive experiments demonstrate the effectiveness of SNSCL.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | Clothing1M (test) | Accuracy75.3 | 546 | |
| Fine-grained Image Classification | CUB200 2011 (test) | Accuracy78.92 | 536 | |
| Fine-grained Image Classification | Stanford Cars (test) | Accuracy88.18 | 348 | |
| Fine-grained Image Classification | Stanford Dogs (test) | Accuracy81.9 | 117 | |
| Fine-grained Image Classification | Aircraft (test) | Best Accuracy84.17 | 40 | |
| Fine grained classification | Stanford Dogs 20% symmetric noise (test) | Best Accuracy81.4 | 20 | |
| Fine grained classification | Stanford Dogs 40% symmetric noise (test) | Best Accuracy79.12 | 20 | |
| Image Classification | Clothing-1M r ≈ 39.5% (test) | Test Accuracy75.31 | 10 | |
| Image Classification | Food-101N r ≈ 20% (test) | Accuracy86.4 | 10 | |
| Fine grained classification | Stanford Cars 20% symmetric noise (test) | Accuracy83.24 | 1 |