Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning

About

Efficient finetuning of large language models (LLMs) aims to adapt the LLMs with reduced computational and memory cost. Previous LoRA-based approaches initialize the low-rank matrices with Gaussian distribution and zero values while keeping the original weight matrices frozen. However, the trainable model parameters optimized in an unguided subspace might interfere with the well-learned subspace of the pretrained weight matrices. In this paper, we propose MiLoRA, a simple yet effective LLM finetuning approach that only updates the minor singular components of the weight matrix while keeping the principal singular components frozen. It is observed that the minor matrix corresponds to the noisy or long-tail information, while the principal matrix contains important knowledge. The MiLoRA initializes the low-rank matrices within a subspace that is orthogonal to the principal matrix, thus the pretrained knowledge is expected to be well preserved. During finetuning, MiLoRA makes the most use of the less-optimized subspace for learning the labeled dataset. Extensive experiments on commonsense reasoning, math reasoning, instruction following and visual instruction following benchmarks present the superior performance of our method.

Hanqing Wang, Yixia Li, Shuo Wang, Guanhua Chen, Yun Chen• 2024

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval
Pass@119.51
1036
Multi-task Language UnderstandingMMLU
Accuracy17.74
876
Mathematical ReasoningGSM8K (test)--
770
ReasoningBBH
Accuracy25.14
672
Image ClassificationEuroSAT
Accuracy98.19
569
Image ClassificationSUN397
Accuracy45.38
441
Mathematical ReasoningMATH (test)
Overall Accuracy17.76
433
ClassificationCars
Accuracy39.77
395
Image ClassificationRESISC45
Accuracy89.92
349
Commonsense ReasoningCommon Sense Reasoning Tasks
Avg Score80
316
Showing 10 of 38 rows

Other info

Follow for update