PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks

About

With the increasing abundance of pretrained models in recent years, the problem of selecting the best pretrained checkpoint for a particular downstream classification task has been gaining increased attention. Although several methods have recently been proposed to tackle the selection problem (e.g. LEEP, H-score), these methods resort to applying heuristics that are not well motivated by learning theory. In this paper we present PACTran, a theoretically grounded family of metrics for pretrained model selection and transferability measurement. We first show how to derive PACTran metrics from the optimal PAC-Bayesian bound under the transfer learning setting. We then empirically evaluate three metric instantiations of PACTran on a number of vision tasks (VTAB) as well as a language-and-vision (OKVQA) task. An analysis of the results shows PACTran is a more consistent and effective transferability measure compared to existing selection methods.

Nan Ding, Xi Chen, Tomer Levinboim, Beer Changpinyo, Radu Soricut• 2022

Related benchmarks

Task	Dataset	Result
Model Selection	DTD	Weighted Kendall's Tau0.614	46
Model Selection	Cars	Weighted Kendall's Tau0.665	36
Model Selection	Pets	Weighted Kendall's Tau0.701	36
Model Selection	CIFAR100	Weighted Kendall's Tau0.742	36
Model Selection	CIFAR10	Weighted Kendall's Tau0.757	36
Model Selection	SUN397	Weighted Kendall's Tau0.638	36
Model Selection	Caltech	Weighted Kendall's Tau0.622	24
PTM Selection	Caltech101	Kendall's weighted tau0.262	19
PTM Selection	Aircraft	Kendall's tau_w0.136	19
Model Selection	Food	Weighted Kendall's Tau0.72	17

Showing 10 of 18 rows

Other info

Code

Follow for update

@wizwand_team Discord