Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

One Algorithm, Two Goals: Dual Scoring for Parameter and Data Selection in LLM Fine-Tuning

About

In Large Language Model (LLM) fine-tuning, parameter and data selection are common strategies for reducing fine-tuning cost, yet they are typically driven by separate scoring mechanisms. When a parameter mask and data subset jointly determine restricted fine-tuning, this separation incurs redundant overhead and makes coordinated selection difficult. We cast parameter and data selection as two bilevel selection problems under a common validation objective and derive a shared local response-surrogate scoring rule. Under first- and second-order validation-improvement approximations, parameter importance and data utility emerge as column-wise and row-wise aggregations of a single gradient interaction matrix, yielding a closed-form row-column correspondence for co-extracting both signals. Building on this structure, we propose DualSFT (Dual-Selection Fine-Tuning), a one-shot dual-scoring algorithm that produces a parameter mask and data subset from shared gradient statistics. On 3B-9B LLMs, single-axis DualSFT variants strengthen target-task performance and stability-plasticity trade-offs within their comparison groups, while full DualSFT yields a more favorable joint-constrained trade-off than sequential hybrid baselines under matched budgets.

Xinrui Chen, Liu Yang, Ou Wu• 2026

Related benchmarks

TaskDatasetResultRank
Code-Specific Instruction Tuning EvaluationMagicoder Evaluation Suite
ARC-C Accuracy54.01
48
Forgetting-aware Instruction TuningMagicoder Stability and Plasticity suites (test)
ARC-C54.01
36
Instruction Fine-tuningMetaMathQA Fine-tuning Evaluation Suite (ARC-C, PIQA, MMLU, HE, GSM8K) (test)
ARC-C Accuracy51.38
32
Instruction TuningMagicoder HumanEval
Stability50.54
7
Showing 4 of 4 rows

Other info

Follow for update