Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Data Manipulation for Augmentation and Weighting

About

Manipulating data, such as weighting data examples or augmenting with new instances, has been increasingly used to improve model training. Previous work has studied various rule- or learning-based approaches designed for specific types of data manipulation. In this work, we propose a new method that supports learning different manipulation schemes with the same gradient-based algorithm. Our approach builds upon a recent connection of supervised learning and reinforcement learning (RL), and adapts an off-the-shelf reward learning algorithm from RL for joint data manipulation learning and model training. Different parameterization of the "data reward" function instantiates different manipulation schemes. We showcase data augmentation that learns a text transformation network, and data weighting that dynamically adapts the data sample importance. Experiments show the resulting algorithms significantly improve the image and text classification performance in low data regime and class-imbalance problems.

Zhiting Hu, Bowen Tan, Ruslan Salakhutdinov, Tom Mitchell, Eric P. Xing• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationClothing1M (test)
Accuracy68.25
546
Text ClassificationSST-2 (test)
Accuracy82.25
185
Image ClassificationPlaces-LT (test)--
128
Text ClassificationSST-5 (test)
Accuracy40.14
58
Image ClassificationCIFAR-100 (test)
Accuracy (0% Noise)70.52
10
Image ClassificationCIFAR100 (test)
Accuracy (0% Noise)70.52
10
Image ClassificationCIFAR-10 (test)
Accuracy (0% Noise)92.7
10
Image ClassificationCIFAR10 (test)
Accuracy (0% Flip Noise)92.7
10
Image ClassificationLong-Tailed CIFAR10 mu=0.1 (test)
Accuracy86.31
8
Image ClassificationLong-Tailed CIFAR10 mu=0.01 (test)
Accuracy0.7027
8
Showing 10 of 14 rows

Other info

Follow for update