Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models

About

Low-Rank Adaptation (LoRA) is a widely used finetuning method for large models. Its small memory footprint allows practitioners to adapt large models to specific tasks at a fraction of the cost of full finetuning. Different modifications have been proposed to enhance its efficiency by, for example, setting the learning rate, the rank, and the initialization. Another improvement axis is adapter placement strategy: when using LoRA, practitioners usually pick module types to adapt with LoRA, such as Query and Key modules. Few works have studied the problem of adapter placement, with nonconclusive results: original LoRA paper suggested placing adapters in attention modules, while other works suggested placing them in the MLP modules. Through an intuitive theoretical analysis, we introduce PLoP (Precise LoRA Placement), a lightweight method that allows automatic identification of module types where LoRA adapters should be placed, given a pretrained model and a finetuning task. We demonstrate that PLoP consistently outperforms, and in the worst case competes, with commonly used placement strategies through comprehensive experiments on supervised finetuning and reinforcement learning for reasoning.

Soufiane Hayou, Nikhil Ghosh, Bin Yu• 2025

Related benchmarks

TaskDatasetResultRank
Multi-turn conversationMT-Bench
Average Score69.1
107
Code GenerationHumanEval+
Pass@158.5
61
Code GenerationHumanEval
HumanEval Accuracy62.8
49
General Instruction TuningGeneral Instruction Tuning Suite MMLU, TyDiQA, CQA, TruthfulQA, GSM8K, LogiQA
MMLU70.9
8
Showing 4 of 4 rows

Other info

Follow for update