Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pruning Convolutional Neural Networks for Resource Efficient Inference

About

We propose a new formulation for pruning convolutional kernels in neural networks to enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by backpropagation - a computationally efficient procedure that maintains good generalization in the pruned network. We propose a new criterion based on Taylor expansion that approximates the change in the cost function induced by pruning network parameters. We focus on transfer learning, where large pretrained networks are adapted to specialized tasks. The proposed criterion demonstrates superior performance compared to other criteria, e.g. the norm of kernel weights or feature map activation, for pruning large CNNs after adaptation to fine-grained classification tasks (Birds-200 and Flowers-102) relaying only on the first order gradient information. We also show that pruning can lead to more than 10x theoretical (5x practical) reduction in adapted 3D-convolutional filters with a small drop in accuracy in a recurrent gesture classifier. Finally, we show results for the large-scale ImageNet dataset to emphasize the flexibility of our approach.

Pavlo Molchanov, Stephen Tyree, Tero Karras, Timo Aila, Jan Kautz• 2016

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText2
Perplexity8.1274
3785
Language ModelingC4
Perplexity24.2774
1688
Language ModelingC4
Perplexity13.9008
1565
Commonsense ReasoningWinoGrande--
1442
Image ClassificationMNIST (test)
Accuracy97.7
894
Question AnsweringPIQA
Accuracy73.56
505
Question AnsweringARC Easy
Accuracy66.75
210
ReasoningGSM8K--
111
Action RecognitionHMDB51 (split 1)
Accuracy34.84
80
Image ClassificationCIFAR-10 8000 images (test)
Accuracy91.1
36
Showing 10 of 19 rows

Other info

Follow for update