Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

About

While Low-Rank Adaptation (LoRA) enables parameter-efficient fine-tuning for Large Language Models (LLMs), its performance often falls short of Full Fine-Tuning (Full FT). Current methods optimize LoRA by initializing with static singular value decomposition (SVD) subsets, leading to suboptimal leveraging of pre-trained knowledge. Another path for improving LoRA is incorporating a Mixture-of-Experts (MoE) architecture. However, weight misalignment and complex gradient dynamics make it challenging to adopt SVD prior to the LoRA MoE architecture. To mitigate these issues, we propose \underline{G}reat L\underline{o}R\underline{A} Mixture-of-Exper\underline{t} (GOAT), a framework that (1) adaptively integrates relevant priors using an SVD-structured MoE, and (2) aligns optimization with full fine-tuned MoE by deriving a theoretical scaling factor. We demonstrate that proper scaling, without modifying the architecture or training algorithms, boosts LoRA MoE's efficiency and performance. Experiments across 25 datasets, including natural language understanding, commonsense reasoning, image classification, and natural language generation, demonstrate GOAT's state-of-the-art performance, closing the gap with Full FT.

Chenghao Fan, Zhenyi Lu, Sichen Liu, Chengfeng Gu, Xiaoye Qu, Wei Wei, Yu Cheng• 2025

Related benchmarks

Task	Dataset	Result
Code Generation	HumanEval	Pass@125.61	1043
Image Classification	EuroSAT	Accuracy98.82	569
Classification	Cars	Accuracy53.5	492
Image Classification	RESISC45	Accuracy93.46	472
Image Classification	SUN397	Accuracy54.53	450
Commonsense Reasoning	Commonsense Reasoning (BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA) (test)	BoolQ Accuracy73.6	238
Image Classification	SVHN	Top-1 Accuracy96.62	186
Image Classification	GTSRB	Accuracy98.17	81
Dialogue	MT-Bench	MT-Bench Score6.01	41
Natural Language Understanding	GLUE	MRPC Score84.55	30

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord