Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models

About

Despite the remarkable capabilities, Large Language Models (LLMs) face deployment challenges due to their extensive size. Pruning methods drop a subset of weights to accelerate, but many of them require retraining, which is prohibitively expensive and computationally demanding. Recently, post-training pruning approaches introduced novel metrics, enabling the pruning of LLMs without retraining. However, these metrics require the involvement of human experts and tedious trial and error. To efficiently identify superior pruning metrics, we develop an automatic framework for searching symbolic pruning metrics using genetic programming. In particular, we devise an elaborate search space encompassing the existing pruning metrics to discover the potential symbolic pruning metric. We propose an opposing operation simplification strategy to increase the diversity of the population. In this way, Pruner-Zero allows auto-generation of symbolic pruning metrics. Based on the searched results, we explore the correlation between pruning metrics and performance after pruning and summarize some principles. Extensive experiments on LLaMA and LLaMA-2 on language modeling and zero-shot tasks demonstrate that our Pruner-Zero obtains superior performance than SOTA post-training pruning methods. Code at: \url{https://github.com/pprp/Pruner-Zero}.

Peijie Dong, Lujun Li, Zhenheng Tang, Xiang Liu, Xinglin Pan, Qiang Wang, Xiaowen Chu• 2024

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText-2 (test)
PPL7.41
2333
Language ModelingWikiText-2
Perplexity (PPL)10.52
2320
Commonsense ReasoningWinoGrande
Accuracy67.23
1442
Question AnsweringARC Challenge
Accuracy (ARC)35.5
598
Physical Interaction Question AnsweringPIQA
Accuracy73.23
415
Mathematical ReasoningMathQA
Accuracy25.12
354
Question AnsweringOpenBookQA
Accuracy27.8
305
Word Sense DisambiguationWiC
Avg Accuracy48.81
261
Logical reasoningLogiQA
LogiQA Accuracy21.78
251
Question AnsweringARC Easy
Accuracy69.23
210
Showing 10 of 34 rows

Other info

Follow for update