VeRA: Vector-based Random Matrix Adaptation

About

Low-rank adapation (LoRA) is a popular method that reduces the number of trainable parameters when finetuning large language models, but still faces acute storage challenges when scaling to even larger models or deploying numerous per-user or per-task adapted models. In this work, we present Vector-based Random Matrix Adaptation (VeRA), which significantly reduces the number of trainable parameters compared to LoRA, yet maintains the same performance. It achieves this by using a single pair of low-rank matrices shared across all layers and learning small scaling vectors instead. We demonstrate its effectiveness on the GLUE and E2E benchmarks, image classification tasks, and show its application in instruction-tuning of 7B and 13B language models.

Dawid J. Kopiczko, Tijmen Blankevoort, Yuki M. Asano• 2023

Related benchmarks

Task	Dataset	Result
Question Answering	ARC Challenge	Accuracy81.02	906
Commonsense Reasoning	PIQA	Accuracy78.63	757
Question Answering	ARC Easy	Accuracy94.05	597
Natural Language Understanding	GLUE	SST-293.89	551
Mathematical Reasoning	MATH 500	Accuracy (Acc)60.6	543
Natural Language Understanding	GLUE (dev)	--	529
Image Classification	RESISC45	Accuracy96.9	472
Natural Language Understanding	GLUE (test)	SST-2 Accuracy96.1	416
Mathematical Reasoning	GSM8K	Accuracy70.4	388
Commonsense Reasoning	Common Sense Reasoning Tasks	Avg Score67.7	321

Showing 10 of 69 rows

Other info

Follow for update

@wizwand_team Discord