One Algorithm, Two Goals: Dual Scoring for Parameter and Data Selection in LLM Fine-Tuning
About
In Large Language Model (LLM) fine-tuning, parameter and data selection are common strategies for reducing fine-tuning cost, yet they are typically driven by separate scoring mechanisms. When a parameter mask and data subset jointly determine restricted fine-tuning, this separation incurs redundant overhead and makes coordinated selection difficult. We cast parameter and data selection as two bilevel selection problems under a common validation objective and derive a shared local response-surrogate scoring rule. Under first- and second-order validation-improvement approximations, parameter importance and data utility emerge as column-wise and row-wise aggregations of a single gradient interaction matrix, yielding a closed-form row-column correspondence for co-extracting both signals. Building on this structure, we propose DualSFT (Dual-Selection Fine-Tuning), a one-shot dual-scoring algorithm that produces a parameter mask and data subset from shared gradient statistics. On 3B-9B LLMs, single-axis DualSFT variants strengthen target-task performance and stability-plasticity trade-offs within their comparison groups, while full DualSFT yields a more favorable joint-constrained trade-off than sequential hybrid baselines under matched budgets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code-Specific Instruction Tuning Evaluation | Magicoder Evaluation Suite | ARC-C Accuracy54.01 | 48 | |
| Forgetting-aware Instruction Tuning | Magicoder Stability and Plasticity suites (test) | ARC-C54.01 | 36 | |
| Instruction Fine-tuning | MetaMathQA Fine-tuning Evaluation Suite (ARC-C, PIQA, MMLU, HE, GSM8K) (test) | ARC-C Accuracy51.38 | 32 | |
| Instruction Tuning | Magicoder HumanEval | Stability50.54 | 7 |