Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations

About

We introduce MIM (Masked Image Modeling)-Refiner, a contrastive learning boost for pre-trained MIM models. MIM-Refiner is motivated by the insight that strong representations within MIM models generally reside in intermediate layers. Accordingly, MIM-Refiner leverages multiple contrastive heads that are connected to different intermediate layers. In each head, a modified nearest neighbor objective constructs semantic clusters that capture semantic information which improves performance on downstream tasks, including off-the-shelf and fine-tuning settings. The refinement process is short and simple - yet highly effective. Within a few epochs, we refine the features of MIM models from subpar to state-of-the-art, off-the-shelf features. Refining a ViT-H, pre-trained with data2vec 2.0 on ImageNet-1K, sets a new state-of-the-art in linear probing (84.7%) and low-shot classification among models that are pre-trained on ImageNet-1K. MIM-Refiner efficiently combines the advantages of MIM and ID objectives and compares favorably against previous state-of-the-art SSL models on a variety of benchmarks such as low-shot classification, long-tailed classification, clustering and semantic segmentation.

Benedikt Alkin, Lukas Miklautz, Sepp Hochreiter, Johannes Brandstetter• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU54.4
2888
Image ClassificationImageNet-1K 1.0 (val)
Top-1 Accuracy86.9
1952
Semantic segmentationADE20K
mIoU43.7
1024
Image ClassificationVTAB 1K
Overall Mean Accuracy75.9
258
Image ClassificationImageNet-1k 1.0 (test)
Top-1 Accuracy87.1
229
Image ClassificationiNaturalist 2018 (test)
Top-1 Accuracy84.5
207
Image ClassificationImageNet-1K
Accuracy87.1
193
Image ClassificationiNaturalist 18
Overall Accuracy84.5
125
Image ClassificationiNaturalist 2018 (val)--
116
Image ClassificationVTAB-6
Accuracy89.8
29
Showing 10 of 17 rows

Other info

Code

Follow for update