Offsite-Tuning: Transfer Learning without Full Model

About

Transfer learning is important for foundation models to adapt to downstream tasks. However, many foundation models are proprietary, so users must share their data with model owners to fine-tune the models, which is costly and raise privacy concerns. Moreover, fine-tuning large foundation models is computation-intensive and impractical for most downstream users. In this paper, we propose Offsite-Tuning, a privacy-preserving and efficient transfer learning framework that can adapt billion-parameter foundation models to downstream data without access to the full model. In offsite-tuning, the model owner sends a light-weight adapter and a lossy compressed emulator to the data owner, who then fine-tunes the adapter on the downstream data with the emulator's assistance. The fine-tuned adapter is then returned to the model owner, who plugs it into the full model to create an adapted foundation model. Offsite-tuning preserves both parties' privacy and is computationally more efficient than the existing fine-tuning methods that require access to the full model weights. We demonstrate the effectiveness of offsite-tuning on various large language and vision foundation models. Offsite-tuning can achieve comparable accuracy as full model fine-tuning while being privacy-preserving and efficient, achieving 6.5x speedup and 5.6x memory reduction. Code is available at https://github.com/mit-han-lab/offsite-tuning.

Guangxuan Xiao, Ji Lin, Song Han• 2023

Related benchmarks

Task	Dataset	Result
Question Answering	ARC Challenge	--	906
Natural Language Understanding	GLUE	SST-296.4	551
Question Answering	OpenBookQA	Accuracy29	465
Physical Interaction Question Answering	PIQA	Accuracy74.5	462
Sentence Completion	HellaSwag	Accuracy43.3	440
Question Answering	ARC Easy	Normalized Acc59.4	420
Question Answering	OBQA	Accuracy34.4	347
Question Answering	ARC-C	Accuracy43.8	283
Science Question Answering	ARC-E	Accuracy76.5	240
Multiple-choice Question Answering	SciQ	Accuracy92.9	91

Showing 10 of 35 rows

Other info

Follow for update

@wizwand_team Discord