Transformed Latent Variable Multi-Output Gaussian Processes

About

Multi-Output Gaussian Processes (MOGPs) provide a principled probabilistic framework for modelling correlated outputs but face scalability bottlenecks when applied to datasets with high-dimensional output spaces. To maintain tractability, existing methods typically resort to restrictive assumptions, such as employing low-rank or sum-of-separable kernels, which can limit expressiveness. We propose the Transformed Latent Variable MOGP (T-LVMOGP), a novel framework that scales MOGPs to a massive number of outputs while preserving the capacity to capture meaningful inter-output dependencies. T-LVMOGP constructs a flexible multi-output deep kernel by mapping inputs and output-specific latent variables into an embedding space using a Lipschitz-regularised neural network. Combined with stochastic variational inference, our model effectively scales to high-dimensional output settings. Across diverse benchmarks, including climate modelling with over 10,000 outputs and zero-inflated spatial transcriptomics data, T-LVMOGP outperforms baselines in both predictive accuracy and computational efficiency.

Xiaoyu Jiang, Xinxing Shi, Sokratia Georgaka, Magnus Rattray, Mauricio A \'Alvarez• 2026

Related benchmarks

Task	Dataset	Result
EEG Prediction	EEG held-out (test)	MSE0.115	7
Multi-Output Gaussian Process Regression	ERA5 block-wise splitting (test)	MSE0.003	7
Spatiotemporal Prediction	ERA5 (random splitting)	MSE0.002	7
Inverse Dynamics Prediction	SARCOS (test)	MSE0.022	5
gene expression prediction	Spatial Transcriptomics dataset	MSE9.189	4
Output extrapolation	Copernicus Marine	MSE0.029	4

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord