Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections

About

Recently, large-scale pre-trained Vision and Language (VL) models have set a new state-of-the-art (SOTA) in zero-shot visual classification enabling open-vocabulary recognition of potentially unlimited set of categories defined as simple language prompts. However, despite these great advances, the performance of these zeroshot classifiers still falls short of the results of dedicated (closed category set) classifiers trained with supervised fine tuning. In this paper we show, for the first time, how to reduce this gap without any labels and without any paired VL data, using an unlabeled image collection and a set of texts auto-generated using a Large Language Model (LLM) describing the categories of interest and effectively substituting labeled visual instances of those categories. Using our label-free approach, we are able to attain significant performance improvements over the zero-shot performance of the base VL model and other contemporary methods and baselines on a wide variety of datasets, demonstrating absolute improvement of up to 11.7% (3.8% on average) in the label-free setting. Moreover, despite our approach being label-free, we observe 1.3% average gains over leading few-shot prompting baselines that do use 5-shot supervision.

M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Mateusz Kozinski, Horst Possegger, Rogerio Feris, Horst Bischof• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100
Top-1 Accuracy74.6
622
Image ClassificationImageNet A
Top-1 Acc31.5
553
Image ClassificationCIFAR-10--
507
Image ClassificationEuroSAT--
497
Image ClassificationDTD--
487
Image ClassificationFlowers102
Accuracy65.25
478
Image ClassificationImageNet-R
Top-1 Acc72.6
474
Image ClassificationSUN397--
425
Image ClassificationDTD
Accuracy51.77
419
Image ClassificationUCF101
Top-1 Acc69.47
404
Showing 10 of 25 rows

Other info

Code

Follow for update