Variational Low-Rank Adaptation Using IVON

About

We show that variational learning can significantly improve the accuracy and calibration of Low-Rank Adaptation (LoRA) without a substantial increase in the cost. We replace AdamW by the Improved Variational Online Newton (IVON) algorithm to finetune large language models. For Llama-2 with 7 billion parameters, IVON improves the accuracy over AdamW by 2.8% and expected calibration error by 4.6%. The accuracy is also better than the other Bayesian alternatives, yet the cost is lower and the implementation is easier. Our work provides additional evidence for the effectiveness of IVON for large language models. The code is available at https://github.com/team-approx-bayes/ivon-lora.

Bai Cong, Nico Daheim, Yuesong Shen, Daniel Cremers, Rio Yokota, Mohammad Emtiyaz Khan, Thomas M\"ollenhoff• 2024

Related benchmarks

Task	Dataset	Result
Regression	UCI Concrete	Sharpness1.452	12
Regression	UCI Naval	Sharpness Score2.158	12
Time Series Forecasting	ETTh1 H=192 (test)	Sharpness1.59	6
Time Series Forecasting	ETTm1 H=192 (test)	Sharpness0.77	6
Time Series Forecasting	ETTh2 Length 96 (test)	Sharpness1.79	6
Regression	UCI Energy	Sharp/SA1.214	6
Regression	UCI Wine	Sharp/SA2.829	6
Time Series Forecasting	ETTh1 H=96 (test)	Sharpness0.4	6
Time Series Forecasting	ETTh2 H=192 (test)	Sharpness0.75	6
Regression	UCI Yacht	Sharpness (SA)1.471	6

Showing 10 of 28 rows

Other info

Follow for update

@wizwand_team Discord