Bayesian Low-rank Adaptation for Large Language Models

About

Low-rank adaptation (LoRA) has emerged as a new paradigm for cost-efficient fine-tuning of large language models (LLMs). However, fine-tuned LLMs often become overconfident especially when fine-tuned on small datasets. Bayesian methods, with their inherent ability to estimate uncertainty, serve as potent tools to mitigate overconfidence and enhance calibration. In this work, we introduce Laplace-LoRA, which applies a Bayesian approach to the LoRA parameters. Specifically, Laplace-LoRA applies a Laplace approximation to the posterior over the LoRA parameters, considerably improving the calibration of fine-tuned LLMs.

Adam X. Yang, Maxime Robeyns, Xi Wang, Laurence Aitchison• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	DTD	--	599
Image Classification	Flowers102	Accuracy91.5	558
Commonsense Reasoning	ARC Challenge	Accuracy66.9	243
Commonsense Reasoning	ARC-C	Accuracy81.05	215
Commonsense Reasoning	OBQA	Accuracy86.006	187
Commonsense Reasoning	ARC-E	Accuracy90.66	152
Multiple-choice Question Answering	OBQA	Accuracy92.53	79
Multiple-choice Question Answering	RACE	Accuracy88.51	64
Commonsense Reasoning	BoolQ	Accuracy86.95	41
Out-of-Distribution Detection	RACE to MMLU	AUROC53.43	41

Showing 10 of 62 rows

Other info

Follow for update

@wizwand_team Discord