Long-tail learning via logit adjustment

About

Real-world classification problems typically exhibit an imbalanced or long-tailed label distribution, wherein many labels are associated with only a few samples. This poses a challenge for generalisation on such labels, and also makes na\"ive learning biased towards dominant labels. In this paper, we present two simple modifications of standard softmax cross-entropy training to cope with these challenges. Our techniques revisit the classic idea of logit adjustment based on the label frequencies, either applied post-hoc to a trained model, or enforced in the loss during training. Such adjustment encourages a large relative margin between logits of rare versus dominant labels. These techniques unify and generalise several recent proposals in the literature, while possessing firmer statistical grounding and empirical performance.

Aditya Krishna Menon, Sadeep Jayasumana, Ankit Singh Rawat, Himanshu Jain, Andreas Veit, Sanjiv Kumar• 2020

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10	--	875
Image Classification	iNaturalist 2018	Top-1 Accuracy68.4	291
Image Classification	ImageNet LT	Top-1 Accuracy56.5	264
Long-Tailed Image Classification	ImageNet-LT (test)	--	246
Image Classification	iNaturalist 2018 (test)	Top-1 Accuracy66.4	223
Image Classification	CIFAR-10 long-tailed (test)	Top-1 Acc75.3	211
Text Classification	SST-2 (test)	Accuracy86.61	185
Image Classification	ImageNet-LT (test)	Top-1 Acc (All)51.1	159
Image Classification	CIFAR100 long-tailed (test)	Accuracy58.6	155
Classification	CIFAR100-LT (test)	Accuracy62.4	136

Showing 10 of 154 rows

...

Other info

Code

Follow for update

@wizwand_team Discord