DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

About

Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution method, but its computational cost often makes it challenging to use. This issue becomes more pronounced in the setting of large language models and text-to-image models. In this work, we propose DataInf, an efficient influence approximation method that is practical for large-scale generative AI models. Leveraging an easy-to-compute closed-form expression, DataInf outperforms existing influence computation algorithms in terms of computational and memory efficiency. Our theoretical analysis shows that DataInf is particularly well-suited for parameter-efficient fine-tuning techniques such as LoRA. Through systematic empirical evaluations, we show that DataInf accurately approximates influence scores and is orders of magnitude faster than existing methods. In applications to RoBERTa-large, Llama-2-13B-chat, and stable-diffusion-v1.5 models, DataInf effectively identifies the most influential fine-tuning examples better than other approximate influence scores. Moreover, it can help to identify which data points are mislabeled.

Yongchan Kwon, Eric Wu, Kevin Wu, James Zou• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100-N	Accuracy58.4	62
Image Classification	ANIMAL-10N	Accuracy0.816	43
Influential Training Sample Identification	Lego Sets (subset)	Top-5 Accuracy16.47	34
Influential Training Sample Identification	Magic Cards	Top-5 Accuracy96.67	34
Influential Training Sample Identification	Flowers	Top-5 Identification Rate85.56	34
Language Modeling	CNN/Daily Mail (test)	Perplexity16.87	28
Image Classification	CIFAR-10N (test)	Accuracy91.88	19
Defense against adaptive adversarial attacks	Bank	Accuracy87.46	18
Defense against adaptive adversarial attacks	CelebA	Accuracy77.36	18
Defense against adaptive adversarial attacks	JigsawToxicity	Accuracy70.82	18

Showing 10 of 34 rows

Other info

Follow for update

@wizwand_team Discord