Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AutoLoRA: Automatically Tuning Matrix Ranks in Low-Rank Adaptation Based on Meta Learning

About

Large-scale pretraining followed by task-specific finetuning has achieved great success in various NLP tasks. Since finetuning all parameters of large pretrained models poses substantial computational and memory challenges, several efficient finetuning methods have been developed. Among them, low-rank adaptation (LoRA), which finetunes low-rank incremental update matrices on top of frozen pretrained weights, has proven particularly effective. Nonetheless, LoRA's uniform rank assignment across all layers, along with its reliance on an exhaustive search to find the best rank, leads to high computation costs and suboptimal finetuning performance. To address these limitations, we introduce AutoLoRA, a meta learning based framework for automatically identifying the optimal rank of each LoRA layer. AutoLoRA associates each rank-1 matrix in a low-rank update matrix with a selection variable, which determines whether the rank-1 matrix should be discarded. A meta learning based method is developed to learn these selection variables. The optimal rank is determined by thresholding the values of these variables. Our comprehensive experiments on natural language understanding, generation, and sequence labeling demonstrate the effectiveness of AutoLoRA.

Ruiyi Zhang, Rushi Qiang, Sai Ashish Somayajula, Pengtao Xie• 2024

Related benchmarks

TaskDatasetResultRank
Language UnderstandingMMLU
Accuracy64.3
825
Commonsense ReasoningPIQA
Accuracy82.5
751
Science Question AnsweringARC Challenge
Accuracy59.6
342
Mathematical ReasoningGSM8K
Accuracy61.3
312
Question AnsweringOBQA
Accuracy81.7
300
Reading ComprehensionBoolQ
Accuracy80.3
279
Instruction FollowingMT-Bench--
215
Science Question AnsweringARC Easy
Accuracy76.9
155
Question AnsweringCommonsenseQA
Accuracy81.5
148
Mathematical ReasoningAQUA
Accuracy47.9
146
Showing 10 of 18 rows

Other info

Follow for update