Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to Weight Parameters for Training Data Attribution

About

We study gradient-based data attribution, aiming to identify which training examples most influence a given output. Existing methods for this task either treat network parameters uniformly or rely on implicit weighting derived from Hessian approximations, which do not fully model functional heterogeneity of network parameters. To address this, we propose a method to explicitly learn parameter importance weights directly from data, without requiring annotated labels. Our approach improves attribution accuracy across diverse tasks, including image classification, language modeling, and diffusion, and enables fine-grained attribution for concepts like subject and style.

Shuangqi Li, Hieu Le, Jingyi Xu, Mathieu Salzmann• 2025

Related benchmarks

TaskDatasetResultRank
Error detectionImageNet
AuROC83.58
35
Data AttributionArtBench-2 (test)
LDS31.58
10
Data AttributionNaruto (test)
LDS20.44
10
Data AttributionSB-Pokemon synthetic (test)
LDS7.97
10
Data AttributionCIFAR-2 CIFAR-10 subset (test)
LDS (%)13.79
10
Data AttributionWikiText-103 (test)
Tail-patch Score7.88
9
Data AttributionWikiText-103
LDS18.33
8
Data AttributionImageNet
LDS23.92
8
Fine-grained Data AttributionSB-Pokemon (train)
Recall@10 (Subject)51.6
5
Fine-grained Data AttributionSB-Pokemon (val)
Recall@10 (Subject)50.6
5
Showing 10 of 11 rows

Other info

Follow for update