SoLar: Sinkhorn Label Refinery for Imbalanced Partial-Label Learning
About
Partial-label learning (PLL) is a peculiar weakly-supervised learning task where the training samples are generally associated with a set of candidate labels instead of single ground truth. While a variety of label disambiguation methods have been proposed in this domain, they normally assume a class-balanced scenario that may not hold in many real-world applications. Empirically, we observe degenerated performance of the prior methods when facing the combinatorial challenge from the long-tailed distribution and partial-labeling. In this work, we first identify the major reasons that the prior work failed. We subsequently propose SoLar, a novel Optimal Transport-based framework that allows to refine the disambiguated labels towards matching the marginal class prior distribution. SoLar additionally incorporates a new and systematic mechanism for estimating the long-tailed class prior distribution under the PLL setup. Through extensive experiments, SoLar exhibits substantially superior results on standardized benchmarks compared to the previous state-of-the-art PLL methods. Code and data are available at: https://github.com/hbzju/SoLar .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Partial-Label Learning | CIFAR10-LT | Accuracy83.8 | 48 | |
| Partial-Label Learning | CIFAR100 LT | Accuracy64.75 | 48 | |
| Partial-Label Learning | BirdSong Uniform (test) | Accuracy0.7205 | 6 | |
| Partial-Label Learning | Soccer Player Uniform (test) | Accuracy57.94 | 6 | |
| Partial-Label Learning | Lost Balanced (test) | Accuracy70.56 | 6 | |
| Partial-Label Learning | BirdSong Balanced (test) | Accuracy68.72 | 6 | |
| Partial-Label Learning | Soccer Player Balanced (test) | Accuracy24.97 | 6 | |
| Partial-Label Learning | Yahoo!News Balanced (test) | Accuracy58.18 | 6 | |
| Partial-Label Learning | Lost Uniform (test) | Accuracy77.86 | 6 | |
| Partial-Label Learning | Yahoo!News Uniform (test) | Accuracy67.62 | 6 |