Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

From Generalized zero-shot learning to long-tail with class descriptors

About

Real-world data is predominantly unbalanced and long-tailed, but deep models struggle to recognize rare classes in the presence of frequent classes. Often, classes can be accompanied by side information like textual descriptions, but it is not fully clear how to use them for learning with unbalanced long-tail data. Such descriptions have been mostly used in (Generalized) Zero-shot learning (ZSL), suggesting that ZSL with class descriptions may also be useful for long-tail distributions. We describe DRAGON, a late-fusion architecture for long-tail learning with class descriptors. It learns to (1) correct the bias towards head classes on a sample-by-sample basis; and (2) fuse information from class-descriptions to improve the tail-class accuracy. We also introduce new benchmarks CUB-LT, SUN-LT, AWA-LT for long-tail learning with class-descriptions, building on existing learning-with-attributes datasets and a version of Imagenet-LT with class descriptors. DRAGON outperforms state-of-the-art models on the new benchmark. It is also a new SoTA on existing benchmarks for GFSL with class descriptors (GFSL-d) and standard (vision-only) long-tailed learning ImageNet-LT, CIFAR-10, 100, and Places365.

Dvir Samuel, Yuval Atzmon, Gal Chechik• 2020

Related benchmarks

TaskDatasetResultRank
Long-Tailed Image ClassificationImageNet-LT (test)--
220
Image ClassificationiNaturalist 2018 (test)
Top-1 Accuracy69.1
192
Image ClassificationPlaces-LT (test)--
128
Long-tailed Visual RecognitionImageNet LT
Overall Accuracy50.1
89
Long-Tailed Image ClassificationiNaturalist 2018
Accuracy69.1
82
Image ClassificationCIFAR-100 Imbalance Ratio LT-50 (test)
Accuracy46.85
62
Image ClassificationCIFAR-100-LT Imbalance Ratio 100 (test)--
62
Image ClassificationCIFAR-100 LT Imbalance Ratio 10 (test)--
59
Long-Tailed Image ClassificationCIFAR10-LT imbalance factor 100 (test)
Top-1 Accuracy79.6
46
Generalized Few-Shot LearningCUB Two-Level distributions (test)
Harmonic Mean Accuracy (Acc_H)69.9
37
Showing 10 of 23 rows

Other info

Code

Follow for update