Accurate and Efficient Low-Rank Model Merging in Core Space
About
In this paper, we address the challenges associated with merging low-rank adaptations of large neural networks. With the rise of parameter-efficient adaptation techniques, such as Low-Rank Adaptation (LoRA), model fine-tuning has become more accessible. While fine-tuning models with LoRA is highly efficient, existing merging methods often sacrifice this efficiency by merging fully-sized weight matrices. We propose the Core Space merging framework, which enables the merging of LoRA-adapted models within a common alignment basis, thereby preserving the efficiency of low-rank adaptation while substantially improving accuracy across tasks. We further provide a formal proof that projection into Core Space ensures no loss of information and provide a complexity analysis showing the efficiency gains. Extensive empirical results demonstrate that Core Space significantly improves existing merging techniques and achieves state-of-the-art results on both vision and language tasks while utilizing a fraction of the computational resources. Codebase is available at https://github.com/apanariello4/core-space-merging.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Natural Language Inference | RTE | Accuracy92.03 | 590 | |
| Natural Language Inference | SNLI | Accuracy88.65 | 196 | |
| Image Classification | Vision Multi-task Suite (SUN397, Cars, RESISC45, EuroSAT, SVHN, GTSRB, MNIST, DTD) | Average Accuracy62.64 | 104 | |
| Visual Classification | 8 Vision Tasks (SUN397, Stanford Cars, RESISC45, EuroSAT, SVHN, GTSRB, MNIST, DTD) | Average Accuracy76.47 | 86 | |
| Natural Language Inference | SICK | Accuracy81.74 | 85 | |
| Natural Language Inference | QNLI | Accuracy79.34 | 78 | |
| Model Merging | Average of 8 benchmarks | Average Accuracy48.02 | 72 | |
| Natural Language Inference | SciTail | Accuracy94.45 | 26 | |
| Multi-domain evaluation | GSM8K, MATH, HumanEval, MBPP, FinanceBench, ConvFinQA, PubMedQA, and MedQA USMLE | Math Accuracy26.2 | 24 | |
| Natural Language Inference | NLP Benchmark (SNLI, MNLI, SICK, QNLI, RTE, SciTail) Llama-3-8B (val) | SNLI Accuracy95.84 | 12 |