TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
About
Instruction tuning is essential for aligning large language models (LLMs) to downstream tasks and commonly relies on large, diverse corpora. However, small, high-quality subsets, known as coresets, can deliver comparable or superior results, though curating them remains challenging. Existing methods often rely on coarse, sample-level signals like gradients, an approach that is computationally expensive and overlooks fine-grained features. To address this, we introduce TRIM (Token Relevance via Interpretable Multi-layer Attention), a forward-only, token-centric framework. Instead of using gradients, TRIM operates by matching underlying representational patterns identified via attention-based "fingerprints" from a handful of target samples. Such an approach makes TRIM highly efficient and uniquely sensitive to the structural features that define a task. Coresets selected by our method consistently outperform state-of-the-art baselines by up to 9% on downstream tasks and even surpass the performance of full-data fine-tuning in some settings. By avoiding expensive backward passes, TRIM achieves this at a fraction of the computational cost. These findings establish TRIM as a scalable and efficient alternative for building high-quality instruction-tuning datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Reasoning | BBH | Accuracy39.73 | 726 | |
| Mathematical Reasoning | SVAMP (test) | Accuracy65.12 | 293 | |
| Social Commonsense Reasoning | SocialIQA | Accuracy46.26 | 143 | |
| commonsense inference | HellaSwag | Accuracy49.08 | 123 | |
| Mathematical Reasoning | GSM8K (test) | EM Accuracy52.23 | 41 | |
| Mathematical Reasoning | NUMGLUE | -- | 39 | |
| Mathematical Reasoning | SIMULEQ | -- | 30 | |
| Commonsense Reasoning | CommonsenseQA | Accuracy40.76 | 19 | |
| Mathematical Reasoning | MATH | Strict Accuracy31.4 | 13 | |
| Question Answering | TyDiQA | Accuracy56.62 | 11 |