One Algorithm, Two Goals: Dual Scoring for Parameter and Data Selection in LLM Fine-Tuning

About

In Large Language Model (LLM) fine-tuning, parameter and data selection are common strategies for reducing fine-tuning cost, yet they are typically driven by separate scoring mechanisms. When a parameter mask and data subset jointly determine restricted fine-tuning, this separation incurs redundant overhead and makes coordinated selection difficult. We cast parameter and data selection as two bilevel selection problems under a common validation objective and derive a shared local response-surrogate scoring rule. Under first- and second-order validation-improvement approximations, parameter importance and data utility emerge as column-wise and row-wise aggregations of a single gradient interaction matrix, yielding a closed-form row-column correspondence for co-extracting both signals. Building on this structure, we propose DualSFT (Dual-Selection Fine-Tuning), a one-shot dual-scoring algorithm that produces a parameter mask and data subset from shared gradient statistics. On 3B-9B LLMs, single-axis DualSFT variants strengthen target-task performance and stability-plasticity trade-offs within their comparison groups, while full DualSFT yields a more favorable joint-constrained trade-off than sequential hybrid baselines under matched budgets.

Xinrui Chen, Liu Yang, Ou Wu• 2026

Related benchmarks

Task	Dataset	Result
Code-Specific Instruction Tuning Evaluation	Magicoder Evaluation Suite	ARC-C Accuracy54.01	48
Forgetting-aware Instruction Tuning	Magicoder Stability and Plasticity suites (test)	ARC-C54.01	36
Instruction Fine-tuning	MetaMathQA Fine-tuning Evaluation Suite (ARC-C, PIQA, MMLU, HE, GSM8K) (test)	ARC-C Accuracy51.38	32
Instruction Tuning	Magicoder HumanEval	Stability50.54	7

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord