Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning
About
Learning representations for individual instances when only bag-level labels are available is a fundamental challenge in multiple instance learning (MIL). Recent works have shown promising results using contrastive self-supervised learning (CSSL), which learns to push apart representations corresponding to two different randomly-selected instances. Unfortunately, in real-world applications such as medical image classification, there is often class imbalance, so randomly-selected instances mostly belong to the same majority class, which precludes CSSL from learning inter-class differences. To address this issue, we propose a novel framework, Iterative Self-paced Supervised Contrastive Learning for MIL Representations (ItS2CLR), which improves the learned representation by exploiting instance-level pseudo labels derived from the bag-level labels. The framework employs a novel self-paced sampling strategy to ensure the accuracy of pseudo labels. We evaluate ItS2CLR on three medical datasets, showing that it improves the quality of instance-level pseudo labels and representations, and outperforms existing MIL methods in terms of both bag and instance level accuracy. Code is available at https://github.com/Kangningthu/ItS2CLR
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Slide-level classification | Camelyon16 | AUC0.9465 | 52 | |
| WSI Classification | CAMELYON16 (test) | Avg Acc90.7 | 28 | |
| Bag-level classification | Camelyon16 | AUC94.25 | 27 | |
| Tumor localization | CAMELYON16 (test) | AUC96.72 | 20 | |
| Bag-level classification | Breast Ultrasound | AUC0.9393 | 5 | |
| Mutation Prediction | TCGA-LUAD | EGFR AUC0.7103 | 5 | |
| WSI Classification | TCGA-LUAD mutation (5-fold cross-validation) | AUC (EGFR)0.7103 | 5 | |
| Instance-level classification | Breast Ultrasound | Instance AUC0.8863 | 3 | |
| Bag-level classification | TCGA-LUAD mutation | EGFR Mutation Accuracy72.3 | 2 |