HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

About

Adapting Large Language Models (LLMs) to new tasks through fine-tuning has been made more efficient by the introduction of Parameter-Efficient Fine-Tuning (PEFT) techniques, such as LoRA. However, these methods often underperform compared to full fine-tuning, particularly in scenarios involving complex datasets. This issue becomes even more pronounced in complex domains, highlighting the need for improved PEFT approaches that can achieve better performance. Through a series of experiments, we have uncovered two critical insights that shed light on the training and parameter inefficiency of LoRA. Building on these insights, we have developed HydraLoRA, a LoRA framework with an asymmetric structure that eliminates the need for domain expertise. Our experiments demonstrate that HydraLoRA outperforms other PEFT approaches, even those that rely on domain knowledge during the training and inference phases.

Chunlin Tian, Zhan Shi, Zhijiang Guo, Li Li, Chengzhong Xu• 2024

Related benchmarks

Task	Dataset	Result
Code Generation	HumanEval	Pass@124.21	1043
Mathematical Reasoning	GSM8K (test)	Accuracy62.8	954
Reasoning	BBH	Accuracy41.17	726
Code Generation	HumanEval (test)	Pass@146.89	612
Image Classification	EuroSAT	Accuracy98.4	569
Classification	Cars	Accuracy48.42	492
Image Classification	RESISC45	Accuracy92.93	472
Image Classification	SUN397	Accuracy51.8	450
Mathematical Reasoning	MATH 500	Accuracy14.8	442
Mathematical Reasoning	SVAMP	Accuracy71.33	403

Showing 10 of 89 rows

...

Other info

Follow for update

@wizwand_team Discord