DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution

About

Fine-tuning large-scale pre-trained models is inherently a resource-intensive task. While it can enhance the capabilities of the model, it also incurs substantial computational costs, posing challenges to the practical application of downstream tasks. Existing parameter-efficient fine-tuning (PEFT) methods such as Low-Rank Adaptation (LoRA) rely on a bypass framework that ignores the differential parameter budget requirements across weight matrices, which may lead to suboptimal fine-tuning outcomes. To address this issue, we introduce the Dynamic Low-Rank Adaptation (DoRA) method. DoRA decomposes high-rank LoRA layers into structured single-rank components, allowing for dynamic pruning of parameter budget based on their importance to specific tasks during training, which makes the most of the limited parameter budget. Experimental results demonstrate that DoRA can achieve competitive performance compared with LoRA and full model fine-tuning, and outperform various strong baselines with the same storage parameter budget. Our code is available at https://github.com/MIkumikumi0116/DoRA

Yulong Mao, Kaiyu Huang, Changhao Guan, Ganglin Bao, Fengran Mo, Jinan Xu• 2024

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	HellaSwag	Accuracy93.62	1896
Mathematical Reasoning	GSM8K (test)	Accuracy72.4	816
Natural Language Understanding	GLUE	SST-291.28	551
Classification	Cars	Accuracy66.63	492
Question Answering	SQuAD v1.1 (dev)	F1 Score92.24	380
Image Classification	Pets	Accuracy90.49	308
Reading Comprehension	RACE high	Accuracy83.39	295
Image Classification	CIFAR10	Accuracy (%)96.51	282
Commonsense Reasoning	Commonsense Reasoning (BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA)	BoolQ Accuracy81.8	223
Reading Comprehension	RACE mid	Accuracy86.77	196

Showing 10 of 16 rows

Other info

Code

Follow for update

@wizwand_team Discord