Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

About

Real-world data often follows a long-tailed distribution, which makes the performance of existing classification algorithms degrade heavily. A key issue is that samples in tail categories fail to depict their intra-class diversity. Humans can imagine a sample in new poses, scenes, and view angles with their prior knowledge even if it is the first time to see this category. Inspired by this, we propose a novel reasoning-based implicit semantic data augmentation method to borrow transformation directions from other classes. Since the covariance matrix of each category represents the feature transformation directions, we can sample new directions from similar categories to generate definitely different instances. Specifically, the long-tailed distributed data is first adopted to train a backbone and a classifier. Then, a covariance matrix for each category is estimated, and a knowledge graph is constructed to store the relations of any two categories. Finally, tail samples are adaptively enhanced via propagating information from all the similar categories in the knowledge graph. Experimental results on CIFAR-100-LT, ImageNet-LT, and iNaturalist 2018 have demonstrated the effectiveness of our proposed method compared with the state-of-the-art methods.

Xiaohua Chen, Yucan Zhou, Dayan Wu, Wanqian Zhang, Yu Zhou, Bo Li, Weiping Wang• 2021

Related benchmarks

Task	Dataset	Result
Image Classification	iNaturalist 2018	--	291
Image Classification	ImageNet LT	Top-1 Accuracy49.3	264
Image Classification	ImageNet-LT (test)	--	159
Image Classification	CIFAR100-LT (test)	--	65
Image Classification	CIFAR-10 LT-100 (test)	Error Rate20.11	40
Long-Tailed Image Classification	CIFAR-100-LT λ=100 (test)	--	21
Image Classification	CIFAR-10-LT	--	17
Long-Tailed Image Classification	CIFAR-100-LT λ=50 (test)	Error Rate46.16	12
Image Classification	CIFAR-10-LT λ=50 (test)	Error Rate0.1576	12
Long-Tailed Image Classification	CIFAR-100-LT λ=10 (test)	Error Rate37.62	11

Showing 10 of 15 rows

Other info

Code

Follow for update

@wizwand_team Discord