Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets

About

Investigating active learning, we focus on the relation between the number of labeled examples (budget size), and suitable querying strategies. Our theoretical analysis shows a behavior reminiscent of phase transition: typical examples are best queried when the budget is low, while unrepresentative examples are best queried when the budget is large. Combined evidence shows that a similar phenomenon occurs in common classification models. Accordingly, we propose TypiClust -- a deep active learning strategy suited for low budgets. In a comparative empirical investigation of supervised learning, using a variety of architectures and image datasets, TypiClust outperforms all other active learning strategies in the low-budget regime. Using TypiClust in the semi-supervised framework, performance gets an even more significant boost. In particular, state-of-the-art semi-supervised methods trained on CIFAR-10 with 10 labeled examples selected by TypiClust, reach 93.2% accuracy -- an improvement of 39.4% over random selection. Code is available at https://github.com/avihu111/TypiClust.

Guy Hacohen, Avihu Dekel, Daphna Weinshall• 2022

Related benchmarks

TaskDatasetResultRank
Visual Question AnsweringVQA v2
Accuracy65.8
1165
Object Hallucination EvaluationPOPE
Accuracy81.6
935
Multimodal EvaluationMME--
557
Text-based Visual Question AnsweringTextVQA
Accuracy53.3
496
Multimodal ReasoningMM-Vet
MM-Vet Score29.7
281
Multimodal EvaluationMM-Vet--
122
Multimodal EvaluationMMBench
MMB Score32.4
118
Image ClassificationCIFAR-100
Accuracy60.4
109
Science Question AnsweringScienceQA SQA-I
Accuracy59.2
81
Multimodal EvaluationSEED-Bench
Accuracy44.9
80
Showing 10 of 16 rows

Other info

Follow for update