Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Swift Sampler: Efficient Learning of Sampler by 10 Parameters

About

Data selection is essential for training deep learning models. An effective data sampler assigns proper sampling probability for training data and helps the model converge to a good local minimum with high performance. Previous studies in data sampling are mainly based on heuristic rules or learning through a huge amount of time-consuming trials. In this paper, we propose an automatic \textbf{swift sampler} search algorithm, \textbf{SS}, to explore automatically learning effective samplers efficiently. In particular, \textbf{SS} utilizes a novel formulation to map a sampler to a low dimension of hyper-parameters and uses an approximated local minimum to quickly examine the quality of a sampler. Benefiting from its low computational expense, \textbf{SS} can be applied on large-scale data sets with high efficiency. Comprehensive experiments on various tasks demonstrate that \textbf{SS} powered sampling can achieve obvious improvements (e.g., 1.5\% on ImageNet) and transfer among different neural networks. Project page: https://github.com/Alexander-Yao/Swift-Sampler.

Jiawei Yao, Chuming Li, Canran Xiao• 2024

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText-2 (test)
PPL21.7
1541
Image ClassificationImageNet ILSVRC-2012 (val)
Top-1 Accuracy84.3
405
Language ModelingWikiText2 (val)
Perplexity (PPL)22.4
277
Image ClassificationCIFAR-100 Noisy (test)
Top-1 Accuracy75.2
25
Image ClassificationCIFAR10 original and noisy labels (test)
Top-1 Acc93.8
25
Face VerificationYTF (test)
Verification Accuracy97.74
24
Few-shot Image Classificationmini-ImageNet 1-shot
Accuracy52.8
2
Few-shot Image Classificationmini-ImageNet 5-shot
Accuracy67.4
2
Few-shot Image ClassificationMini-ImageNet 10-shot
Accuracy73.4
2
Foundation Model TrainingLAION 500M 5B
Top-1 Accuracy74.7
2
Showing 10 of 10 rows

Other info

Code

Follow for update