Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ResizeMix: Mixing Data with Preserved Object Information and True Labels

About

Data augmentation is a powerful technique to increase the diversity of data, which can effectively improve the generalization ability of neural networks in image recognition tasks. Recent data mixing based augmentation strategies have achieved great success. Especially, CutMix uses a simple but effective method to improve the classifiers by randomly cropping a patch from one image and pasting it on another image. To further promote the performance of CutMix, a series of works explore to use the saliency information of the image to guide the mixing. We systematically study the importance of the saliency information for mixing data, and find that the saliency information is not so necessary for promoting the augmentation performance. Furthermore, we find that the cutting based data mixing methods carry two problems of label misallocation and object information missing, which cannot be resolved simultaneously. We propose a more effective but very easily implemented method, namely ResizeMix. We mix the data by directly resizing the source image to a small patch and paste it on another image. The obtained patch preserves more substantial object information compared with conventional cut-based methods. ResizeMix shows evident advantages over CutMix and the saliency-guided methods on both image classification and object detection tasks without additional computation cost, which even outperforms most costly search-based automatic augmentation methods.

Jie Qin, Jiemin Fang, Qian Zhang, Wenyu Liu, Xingang Wang, Xinggang Wang• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)
Top-1 Accuracy81.64
1469
Object Hallucination EvaluationPOPE--
1455
Image ClassificationCIFAR-100--
691
Image ClassificationTiny ImageNet (test)
Accuracy65.87
362
Fine-grained Image ClassificationStanford Cars (test)
Accuracy91.59
348
Image ClassificationStanford Cars (test)--
316
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc77.62
312
Image ClassificationCIFAR-100--
302
Image ClassificationiNaturalist 2018
Top-1 Accuracy69.3
291
Object DetectionCOCO
AP50 (Box)59.4
237
Showing 10 of 26 rows

Other info

Follow for update