ResizeMix: Mixing Data with Preserved Object Information and True Labels

About

Data augmentation is a powerful technique to increase the diversity of data, which can effectively improve the generalization ability of neural networks in image recognition tasks. Recent data mixing based augmentation strategies have achieved great success. Especially, CutMix uses a simple but effective method to improve the classifiers by randomly cropping a patch from one image and pasting it on another image. To further promote the performance of CutMix, a series of works explore to use the saliency information of the image to guide the mixing. We systematically study the importance of the saliency information for mixing data, and find that the saliency information is not so necessary for promoting the augmentation performance. Furthermore, we find that the cutting based data mixing methods carry two problems of label misallocation and object information missing, which cannot be resolved simultaneously. We propose a more effective but very easily implemented method, namely ResizeMix. We mix the data by directly resizing the source image to a small patch and paste it on another image. The obtained patch preserves more substantial object information compared with conventional cut-based methods. ResizeMix shows evident advantages over CutMix and the saliency-guided methods on both image classification and object detection tasks without additional computation cost, which even outperforms most costly search-based automatic augmentation methods.

Jie Qin, Jiemin Fang, Qian Zhang, Wenyu Liu, Xingang Wang, Xinggang Wang• 2020

Related benchmarks

Task	Dataset	Result
Object Hallucination Evaluation	POPE	--	2019
Image Classification	ImageNet-1k (val)	Top-1 Accuracy81.64	1498
Image Classification	Tiny ImageNet (test)	Accuracy65.87	722
Image Classification	CIFAR-100	--	691
Fine-grained Image Classification	Stanford Cars (test)	Accuracy91.59	372
Image Classification	Stanford Cars (test)	--	320
Fine-grained visual classification	FGVC-Aircraft (test)	Top-1 Acc77.62	312
Image Classification	CIFAR-100	--	302
Image Classification	iNaturalist 2018	Top-1 Accuracy69.3	291
Object Detection	COCO	AP50 (Box)59.4	237

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord