Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models

About

Large pretrained visual models exhibit remarkable generalization across diverse recognition tasks. Yet, real-world applications often demand compact models tailored to specific problems. Variants of knowledge distillation have been devised for such a purpose, enabling task-specific compact models (the students) to learn from a generic large pretrained one (the teacher). In this paper, we show that the excellent robustness and versatility of recent pretrained models challenge common practices established in the literature, calling for a new set of optimal guidelines for task-specific distillation. To address the lack of samples in downstream tasks, we also show that a variant of Mixup based on stable diffusion complements standard data augmentation. This strategy eliminates the need for engineered text prompts and improves distillation of generic models into streamlined specialized networks.

Juliette Marrie, Michael Arbel, Julien Mairal, Diane Larlus• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes
mIoU74
668
Fine grained classificationAircraft
Top-1 Acc91.02
72
Fine-grained Image ClassificationCUB
Top-1 Acc90.61
45
Fine grained classificationDTD
Accuracy83.49
38
Image ClassificationOxford Pets VTAB natural (test)
Accuracy95.94
9
Image ClassificationCaltech101 VTAB natural (test)
Accuracy98.21
9
Showing 6 of 6 rows

Other info

Follow for update