Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OCD: Learning to Overfit with Conditional Diffusion Models

About

We present a dynamic model in which the weights are conditioned on an input sample x and are learned to match those that would be obtained by finetuning a base model on x and its label y. This mapping between an input sample and network weights is approximated by a denoising diffusion model. The diffusion model we employ focuses on modifying a single layer of the base model and is conditioned on the input, activations, and output of this layer. Since the diffusion model is stochastic in nature, multiple initializations generate different networks, forming an ensemble, which leads to further improvements. Our experiments demonstrate the wide applicability of the method for image classification, 3D reconstruction, tabular data, speech separation, and natural language processing. Our code is available at https://github.com/ShaharLutatiPersonal/OCD

Shahar Lutati, Lior Wolf• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationMNIST (test)--
882
Image ClassificationTinyImageNet (test)--
366
Image ClassificationTinyImageNet
Accuracy92
108
Text ClassificationSST-5 (test)
Accuracy48.6
58
RegressionCalifornia Housing (CH) (test)
MSE0.48
52
Sentence ClassificationCR (test)
Accuracy91.2
33
Audio SeparationLibri5Mix (test)
SI-SDRi (dB)13.9
6
Text ClassificationAmazonCF (test)
Accuracy41.2
5
RegressionMicrosoft LETOR 4.0 (test)
MSE0.743
5
Text ClassificationEmotion (test)
Accuracy50.5
5
Showing 10 of 14 rows

Other info

Follow for update