Mixture-of-Subspaces in Low-Rank Adaptation

About

In this paper, we introduce a subspace-inspired Low-Rank Adaptation (LoRA) method, which is computationally efficient, easy to implement, and readily applicable to large language, multimodal, and diffusion models. Initially, we equivalently decompose the weights of LoRA into two subspaces, and find that simply mixing them can enhance performance. To study such a phenomenon, we revisit it through a fine-grained subspace lens, showing that such modification is equivalent to employing a fixed mixer to fuse the subspaces. To be more flexible, we jointly learn the mixer with the original LoRA weights, and term the method Mixture-of-Subspaces LoRA (MoSLoRA). MoSLoRA consistently outperforms LoRA on tasks in different modalities, including commonsense reasoning, visual instruction tuning, and subject-driven text-to-image generation, demonstrating its effectiveness and robustness. Codes are available at https://github.com/wutaiqiang/MoSLoRA.

Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong• 2024

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	HellaSwag	Accuracy93.53	1896
Visual Question Answering	TextVQA	Accuracy50.2	1453
Multimodal Evaluation	MME	Score64.1	727
Natural Language Understanding	GLUE	SST-296.17	551
Multimodal Capability Evaluation	MM-Vet	Score35.2	393
Diagram Question Answering	AI2D	AI2D Accuracy66.1	387
Reading Comprehension	RACE high	Accuracy83.75	295
Commonsense Reasoning	Commonsense Reasoning (BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA) (test)	BoolQ Accuracy74.6	238
Reading Comprehension	RACE mid	Accuracy86.13	196
Science Question Answering	ScienceQA SQA-IMG	Accuracy76	186

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord