Full Parameter Fine-tuning for Large Language Models with Limited Resources

About

Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP) but demand massive GPU resources for training. Lowering the threshold for LLMs training would encourage greater participation from researchers, benefiting both academia and society. While existing approaches have focused on parameter-efficient fine-tuning, which tunes or adds a small number of parameters, few have addressed the challenge of tuning the full parameters of LLMs with limited resources. In this work, we propose a new optimizer, LOw-Memory Optimization (LOMO), which fuses the gradient computation and the parameter update in one step to reduce memory usage. By integrating LOMO with existing memory saving techniques, we reduce memory usage to 10.8% compared to the standard approach (DeepSpeed solution). Consequently, our approach enables the full parameter fine-tuning of a 65B model on a single machine with 8 RTX 3090, each with 24GB memory.Code and data are available at https://github.com/OpenLMLab/LOMO.

Kai Lv, Yuqing Yang, Tengxiao Liu, Qinghui Gao, Qipeng Guo, Xipeng Qiu• 2023

Related benchmarks

Task	Dataset	Result
Multitask Language Understanding	MMLU	Accuracy66.2	520
Mathematical Reasoning	GSM8K	Accuracy74.6	499
Instruction Following	MT-Bench	MT-Bench Score5.8	287
Mathematical Reasoning	AQUA	Accuracy28	167
Natural Language Understanding	SuperGLUE	SGLUE Score89.9	84
Instruction Following	MT-bench v1.0 (test)	MT-Bench Score5.69	52
Mathematical Reasoning	Math Benchmarks Aggregate	--	44
Multi-turn conversation	MT-Bench	Conversation Rating (1-10)7.5	41
Mathematical Reasoning	NUMGLUE	Accuracy37.1	39
Mathematical Reasoning	MMLU Math	Score40	9

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord