VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks

About

As the adoption of large language models increases and the need for per-user or per-task model customization grows, the parameter-efficient fine-tuning (PEFT) methods, such as low-rank adaptation (LoRA) and its variants, incur substantial storage and transmission costs. To further reduce stored parameters, we introduce a "divide-and-share" paradigm that breaks the barriers of low-rank decomposition across matrix dimensions, modules, and layers by sharing parameters globally via a vector bank. As an instantiation of the paradigm to LoRA, our proposed VB-LoRA composites all the low-rank matrices of LoRA from a shared vector bank with a differentiable top-k admixture module. VB-LoRA achieves extreme parameter efficiency while maintaining comparable or better performance compared to state-of-the-art PEFT methods. Extensive experiments demonstrate the effectiveness of VB-LoRA on natural language understanding, natural language generation, instruction tuning, and mathematical reasoning tasks. When fine-tuning the Llama2-13B model, VB-LoRA only uses 0.4% of LoRA's stored parameters, yet achieves superior results. Our source code is available at https://github.com/leo-yangli/VB-LoRA. This method has been merged into the Hugging Face PEFT package.

Yang Li, Shaobo Han, Shihao Ji• 2024

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K (test)	Accuracy75.96	816
Mathematical Reasoning	MATH (test)	Overall Accuracy28.9	433
Image Classification	VTAB 1K	Overall Mean Accuracy54.6	281
Natural language generation	E2E (test)	ROUGE-L72.2	100
Language Understanding	MMLU	Average Accuracy57.88	50
Natural Language Understanding	GLUE	COLA Score69.3	41
Mathematical Reasoning	GSM8K and MATH	GSM8K Score75.96	38
Law reasoning	Law	Accuracy24.84	27
Instruction Following	MT-Bench GPT-4 scored (test)	Score6.31	14
Multi-task Evaluation	Average GSM8K, HumanEval, ARC-c	Accuracy29.09	13

Showing 10 of 15 rows

Other info

Code

Follow for update

@wizwand_team Discord