Parameter-Efficient Fine-Tuning with Discrete Fourier Transform

About

Low-rank adaptation~(LoRA) has recently gained much interest in fine-tuning foundation models. It effectively reduces the number of trainable parameters by incorporating low-rank matrices $A$ and $B$ to represent the weight change, i.e., $\Delta W=BA$. Despite LoRA's progress, it faces storage challenges when handling extensive customization adaptations or larger base models. In this work, we aim to further compress trainable parameters by enjoying the powerful expressiveness of the Fourier transform. Specifically, we introduce FourierFT, which treats $\Delta W$ as a matrix in the spatial domain and learns only a small fraction of its spectral coefficients. With the trained spectral coefficients, we implement the inverse discrete Fourier transform to recover $\Delta W$. Empirically, our FourierFT method shows comparable or better performance with fewer parameters than LoRA on various tasks, including natural language understanding, natural language generation, instruction tuning, and image classification. For example, when performing instruction tuning on the LLaMA2-7B model, FourierFT surpasses LoRA with only 0.064M trainable parameters, compared to LoRA's 33.5M. Our code is released at \url{https://github.com/Chaos96/fourierft}.

Ziqi Gao, Qichao Wang, Aochuan Chen, Zijing Liu, Bingzhe Wu, Liang Chen, Jia Li• 2024

Related benchmarks

Task	Dataset	Result
Image Classification	EuroSAT	Accuracy98.02	569
Natural Language Understanding	GLUE	SST-295.3	551
Classification	Cars	Accuracy39.2	492
Image Classification	RESISC45	Accuracy95.2	472
Image Classification	SUN397	Accuracy61.92	450
Image Classification	StanfordCars	Accuracy79.14	384
Image Classification	CIFAR100	Accuracy93.37	301
Image Classification	OxfordPets	Accuracy94.84	298
Image Classification	CIFAR10	Accuracy (%)99.1	282
Image Classification	VTAB 1K	Overall Mean Accuracy72.8	281

Showing 10 of 33 rows

Other info

Follow for update

@wizwand_team Discord