Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

About

Real-world data often follows a long-tailed distribution, which makes the performance of existing classification algorithms degrade heavily. A key issue is that samples in tail categories fail to depict their intra-class diversity. Humans can imagine a sample in new poses, scenes, and view angles with their prior knowledge even if it is the first time to see this category. Inspired by this, we propose a novel reasoning-based implicit semantic data augmentation method to borrow transformation directions from other classes. Since the covariance matrix of each category represents the feature transformation directions, we can sample new directions from similar categories to generate definitely different instances. Specifically, the long-tailed distributed data is first adopted to train a backbone and a classifier. Then, a covariance matrix for each category is estimated, and a knowledge graph is constructed to store the relations of any two categories. Finally, tail samples are adaptively enhanced via propagating information from all the similar categories in the knowledge graph. Experimental results on CIFAR-100-LT, ImageNet-LT, and iNaturalist 2018 have demonstrated the effectiveness of our proposed method compared with the state-of-the-art methods.

Xiaohua Chen, Yucan Zhou, Dayan Wu, Wanqian Zhang, Yu Zhou, Bo Li, Weiping Wang• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationiNaturalist 2018--
287
Image ClassificationImageNet LT
Top-1 Accuracy49.3
251
Image ClassificationImageNet-LT (test)--
159
Image ClassificationCIFAR100-LT (test)
Top-1 Acc (IR=100)50.16
45
Long-Tailed Image ClassificationCIFAR-100-LT λ=100 (test)--
21
Long-Tailed Image ClassificationCIFAR-100-LT λ=50 (test)
Error Rate46.16
12
Image ClassificationCIFAR-10-LT λ=50 (test)
Error Rate0.1576
12
Image ClassificationCIFAR-10 LT-100 (test)
Error Rate20.11
12
Long-Tailed Image ClassificationCIFAR-100-LT λ=10 (test)
Error Rate37.62
11
Image ClassificationCIFAR-10-LT λ=10 (test)
Error Rate10.64
11
Showing 10 of 15 rows

Other info

Code

Follow for update