Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching

About

The lightweight "local-match-global" matching introduced by SRe2L successfully creates a distilled dataset with comprehensive information on the full 224x224 ImageNet-1k. However, this one-sided approach is limited to a particular backbone, layer, and statistics, which limits the improvement of the generalization of a distilled dataset. We suggest that sufficient and various "local-match-global" matching are more precise and effective than a single one and has the ability to create a distilled dataset with richer information and better generalization. We call this perspective "generalized matching" and propose Generalized Various Backbone and Statistical Matching (G-VBSM) in this work, which aims to create a synthetic dataset with densities, ensuring consistency with the complete dataset across various backbones, layers, and statistics. As experimentally demonstrated, G-VBSM is the first algorithm to obtain strong performance across both small-scale and large-scale datasets. Specifically, G-VBSM achieves a performance of 38.7% on CIFAR-100 with 128-width ConvNet, 47.6% on Tiny-ImageNet with ResNet18, and 31.4% on the full 224x224 ImageNet-1k with ResNet18, under images per class (IPC) 10, 50, and 10, respectively. These results surpass all SOTA methods by margins of 3.9%, 6.5%, and 10.1%, respectively.

Shitong Shao, Zeyuan Yin, Muxin Zhou, Xindong Zhang, Zhiqiang Shen• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	Accuracy45.7	3518
Image Classification	CIFAR-10 (test)	Accuracy55.6	3381
Image Classification	ImageNet-1K 1.0 (val)	Top-1 Accuracy59.2	2386
Image Classification	ImageNet-1K	Top-1 Acc63.7	1239
Image Classification	ImageNet 1k (test)	Top-1 Accuracy63.7	939
Image Classification	Tiny ImageNet (test)	--	859
Image Classification	ImageNet-1K	--	600
Image Classification	CIFAR-10	--	507
Image Classification	ImageNet 1k (test)	Top-1 Accuracy56.1	490
Image Classification	ImageNet-1k 1.0 (test)	Top-1 Accuracy16.6	341

Showing 10 of 28 rows

Other info

Follow for update

@wizwand_team Discord