Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Predicting Parameters in Deep Learning

About

We demonstrate that there is significant redundancy in the parameterization of several deep learning models. Given only a few weight values for each feature it is possible to accurately predict the remaining values. Moreover, we show that not only can the parameter values be predicted, but many of them need not be learned at all. We train several different architectures by learning only a small number of weights and predicting the rest. In the best case we are able to predict more than 95% of the weights of a network without any drop in accuracy.

Misha Denil, Babak Shakibi, Laurent Dinh, Marc'Aurelio Ranzato, Nando de Freitas• 2013

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningHellaSwag
Accuracy37.9
1460
Question AnsweringARC Challenge--
749
Commonsense ReasoningPIQA
Accuracy66.6
647
Question AnsweringARC Easy
Accuracy41.4
386
Question AnsweringSciQ--
226
Language ModelingLAMBADA
Accuracy32.4
183
Reading ComprehensionRACE
Accuracy28.9
151
Multi-task Language UnderstandingMMLU
Accuracy24.4
101
Language ModelingWikiText (val)
Perplexity33.28
34
Showing 9 of 9 rows

Other info

Follow for update