GradOT: Training-free Gradient-preserving Offsite-tuning for Large Language Models

About

The rapid growth of large language models (LLMs) with traditional centralized fine-tuning emerges as a key technique for adapting these models to domain-specific challenges, yielding privacy risks for both model and data owners. One promising solution, called offsite-tuning (OT), is proposed to address these challenges, where a weaker emulator is compressed from the original model and further fine-tuned with adapter to enhance privacy. However, the existing OT-based methods require high computational costs and lack theoretical analysis. This paper introduces a novel OT approach based on gradient-preserving compression, named GradOT. By analyzing the OT problem through the lens of optimization, we propose a method that selectively applies compression techniques such as rank compression and channel pruning, preserving the gradients of fine-tuned adapters while ensuring privacy. Extensive experiments demonstrate that our approach surpasses existing OT methods, both in terms of privacy protection and model performance. Our method provides a theoretical foundation for OT and offers a practical, training-free solution for offsite-tuning of large-scale LLMs.

Kai Yao, Zhaorui Tan, Penglei Gao, Lichun Li, Kaixin Wu, Yinggui Wang, Yuan Zhao, Yixin Ji, Wei Wang, Jianke Zhu• 2025

Related benchmarks

Task	Dataset	Result
Question Answering	ARC Challenge	--	906
Question Answering	OpenBookQA	Accuracy30.8	465
Physical Interaction Question Answering	PIQA	Accuracy73.6	415
Question Answering	ARC Easy	Normalized Acc61.3	391
Sentence Completion	HellaSwag	Accuracy41.7	364
Question Answering	OBQA	Accuracy36.2	347
Question Answering	ARC-C	Accuracy50.1	258
Science Question Answering	ARC-E	Accuracy77.8	240
Multiple-choice Question Answering	SciQ	Accuracy93.9	91
Question Answering	WebQuestions (WebQs)	Accuracy48.1	67

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord