MOKD: Cross-domain Finetuning for Few-shot Classification via Maximizing Optimized Kernel Dependence
About
In cross-domain few-shot classification, \emph{nearest centroid classifier} (NCC) aims to learn representations to construct a metric space where few-shot classification can be performed by measuring the similarities between samples and the prototype of each class. An intuition behind NCC is that each sample is pulled closer to the class centroid it belongs to while pushed away from those of other classes. However, in this paper, we find that there exist high similarities between NCC-learned representations of two samples from different classes. In order to address this problem, we propose a bi-level optimization framework, \emph{maximizing optimized kernel dependence} (MOKD) to learn a set of class-specific representations that match the cluster structures indicated by labeled data of the given task. Specifically, MOKD first optimizes the kernel adopted in \emph{Hilbert-Schmidt independence criterion} (HSIC) to obtain the optimized kernel HSIC (opt-HSIC) that can capture the dependence more precisely. Then, an optimization problem regarding the opt-HSIC is addressed to simultaneously maximize the dependence between representations and labels and minimize the dependence among all samples. Extensive experiments on Meta-Dataset demonstrate that MOKD can not only achieve better generalization performance on unseen domains in most cases but also learn better data representation clusters. The project repository of MOKD is available at: \href{https://github.com/tmlr-group/MOKD}{https://github.com/tmlr-group/MOKD}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot classification | Meta-Dataset (test) | Omniglot70.9 | 48 | |
| Few-shot Image Classification | Meta-Dataset (test) | Omniglot Accuracy96 | 40 | |
| Few-shot Image Classification | Aircraft (test) | Mean Accuracy88.4 | 28 | |
| Few-shot classification | Fungi Meta-Dataset (test) | Mean Accuracy68.6 | 10 | |
| Few-shot classification | VGG Flower Meta-Dataset (test) | Mean Accuracy92.5 | 10 | |
| Few-shot classification | Traffic Sign Meta-Dataset (test) | Mean Accuracy64.5 | 10 | |
| Few-shot classification | CIFAR100 (test) | Accuracy63.9 | 10 | |
| Few-shot classification | Birds Meta-Dataset (test) | Mean Accuracy80.4 | 10 | |
| Few-shot classification | Textures Meta-Dataset (test) | Mean Accuracy76.5 | 10 | |
| Few-shot classification | Quick Draw Meta-Dataset (test) | Mean Accuracy82.2 | 10 |