Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Variational Low-Rank Adaptation Using IVON

About

We show that variational learning can significantly improve the accuracy and calibration of Low-Rank Adaptation (LoRA) without a substantial increase in the cost. We replace AdamW by the Improved Variational Online Newton (IVON) algorithm to finetune large language models. For Llama-2 with 7 billion parameters, IVON improves the accuracy over AdamW by 2.8% and expected calibration error by 4.6%. The accuracy is also better than the other Bayesian alternatives, yet the cost is lower and the implementation is easier. Our work provides additional evidence for the effectiveness of IVON for large language models. The code is available at https://github.com/team-approx-bayes/ivon-lora.

Bai Cong, Nico Daheim, Yuesong Shen, Daniel Cremers, Rio Yokota, Mohammad Emtiyaz Khan, Thomas M\"ollenhoff• 2024

Related benchmarks

TaskDatasetResultRank
RegressionUCI Concrete
Sharpness1.452
12
RegressionUCI Naval
Sharpness Score2.158
12
Time Series ForecastingETTh1 H=192 (test)
Sharpness1.59
6
Time Series ForecastingETTm1 H=192 (test)
Sharpness0.77
6
Time Series ForecastingETTh2 Length 96 (test)
Sharpness1.79
6
RegressionUCI Energy
Sharp/SA1.214
6
RegressionUCI Wine
Sharp/SA2.829
6
Time Series ForecastingETTh1 H=96 (test)
Sharpness0.4
6
Time Series ForecastingETTh2 H=192 (test)
Sharpness0.75
6
RegressionUCI Yacht
Sharpness (SA)1.471
6
Showing 10 of 28 rows

Other info

Follow for update