MOKD: Cross-domain Finetuning for Few-shot Classification via Maximizing Optimized Kernel Dependence

About

In cross-domain few-shot classification, \emph{nearest centroid classifier} (NCC) aims to learn representations to construct a metric space where few-shot classification can be performed by measuring the similarities between samples and the prototype of each class. An intuition behind NCC is that each sample is pulled closer to the class centroid it belongs to while pushed away from those of other classes. However, in this paper, we find that there exist high similarities between NCC-learned representations of two samples from different classes. In order to address this problem, we propose a bi-level optimization framework, \emph{maximizing optimized kernel dependence} (MOKD) to learn a set of class-specific representations that match the cluster structures indicated by labeled data of the given task. Specifically, MOKD first optimizes the kernel adopted in \emph{Hilbert-Schmidt independence criterion} (HSIC) to obtain the optimized kernel HSIC (opt-HSIC) that can capture the dependence more precisely. Then, an optimization problem regarding the opt-HSIC is addressed to simultaneously maximize the dependence between representations and labels and minimize the dependence among all samples. Extensive experiments on Meta-Dataset demonstrate that MOKD can not only achieve better generalization performance on unseen domains in most cases but also learn better data representation clusters. The project repository of MOKD is available at: \href{https://github.com/tmlr-group/MOKD}{https://github.com/tmlr-group/MOKD}.

Hongduan Tian, Feng Liu, Tongliang Liu, Bo Du, Yiu-ming Cheung, Bo Han• 2024

Related benchmarks

Task	Dataset	Result
Few-shot classification	Meta-Dataset (test)	Omniglot70.9	48
Few-shot Image Classification	Meta-Dataset (test)	Omniglot Accuracy96	40
Few-shot Image Classification	Aircraft (test)	Mean Accuracy88.4	28
Few-shot classification	Fungi Meta-Dataset (test)	Mean Accuracy68.6	10
Few-shot classification	VGG Flower Meta-Dataset (test)	Mean Accuracy92.5	10
Few-shot classification	Traffic Sign Meta-Dataset (test)	Mean Accuracy64.5	10
Few-shot classification	CIFAR100 (test)	Accuracy63.9	10
Few-shot classification	Birds Meta-Dataset (test)	Mean Accuracy80.4	10
Few-shot classification	Textures Meta-Dataset (test)	Mean Accuracy76.5	10
Few-shot classification	Quick Draw Meta-Dataset (test)	Mean Accuracy82.2	10

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord