3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability

About

Parameter-efficient finetuning (PEFT) methods effectively adapt large language models (LLMs) to diverse downstream tasks, reducing storage and GPU memory demands. Despite these advantages, several applications pose new challenges to PEFT beyond mere parameter efficiency. One notable challenge involves the efficient deployment of LLMs equipped with multiple task- or user-specific adapters, particularly when different adapters are needed for distinct requests within the same batch. Another challenge is the interpretability of LLMs, which is crucial for understanding how LLMs function. Previous studies introduced various approaches to address different challenges. In this paper, we introduce a novel method, RoAd, which employs a straightforward 2D rotation to adapt LLMs and addresses all the above challenges: (1) RoAd is remarkably parameter-efficient, delivering optimal performance on GLUE, eight commonsense reasoning tasks and four arithmetic reasoning tasks with $<0.1\%$ trainable parameters; (2) RoAd facilitates the efficient serving of requests requiring different adapters within a batch, with an overhead comparable to element-wise multiplication instead of batch matrix multiplication; (3) RoAd enhances LLM's interpretability through integration within a framework of distributed interchange intervention, demonstrated via composition experiments.

Baohao Liao, Christof Monz• 2024

Related benchmarks

Task	Dataset	Result
Instruction Following	AlpacaEval 2.0	Win Rate62.64	722
Natural Language Understanding	GLUE (dev)	SST-2 (Acc)96.3	529
Image Classification	VTAB 1K	Overall Mean Accuracy57.5	281
Commonsense Reasoning	Commonsense Reasoning (BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA) (test)	BoolQ Accuracy73.3	238
Commonsense Reasoning	Commonsense Reasoning (BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA)	BoolQ Accuracy74.6	223
Arithmetic Reasoning	GSM8K (test)	Accuracy40.7	189
Mathematical Reasoning	MAWPS (test)	Accuracy84.9	87
Arithmetic Reasoning	SVAMP (test)	Accuracy59.5	70
Arithmetic Reasoning	AQuA (test)	Accuracy26.8	58
Visual Instruction Tuning	LLaVA Evaluation Suite (GQA, SQA, VQAT, POPE) 1.5 (test)	Average Score68.5	7

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord