MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting

About

We propose novel statistics which maximise the power of a two-sample test based on the Maximum Mean Discrepancy (MMD), by adapting over the set of kernels used in defining it. For finite sets, this reduces to combining (normalised) MMD values under each of these kernels via a weighted soft maximum. Exponential concentration bounds are proved for our proposed statistics under the null and alternative. We further show how these kernels can be chosen in a data-dependent but permutation-independent way, in a well-calibrated test, avoiding data splitting. This technique applies more broadly to general permutation-based MMD testing, and includes the use of deep kernels with features learnt using unsupervised models such as auto-encoders. We highlight the applicability of our MMD-FUSE test on both synthetic low-dimensional and real-world high-dimensional data, and compare its performance in terms of power against current state-of-the-art kernel tests.

Felix Biggs, Antonin Schrab, Arthur Gretton• 2023

Related benchmarks

Task	Dataset	Result
Two-sample testing	CIFAR-10 vs CIFAR-10.1 (test)	Power0.937	175
Adversarial Detection	CIFAR-10 (test)	--	160
Two-sample testing	higgs	Test Power100	159
Two-sample testing	CIFAR10-RES18 (test)	Test Power97.5	97
Two-sample testing	Blob	Test Power1	49
Two-sample testing	BLOB (test)	Test Power16.3	49
Two-sample testing	CIFAR10-WRN8	Test Power32.2	49
Two-sample testing	CIFAR10 WRN28	Test Power11.5	49
Adversarial Detection	CIFAR-10	FGSM Acc96.7	25

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord