Transformed Latent Variable Multi-Output Gaussian Processes
About
Multi-Output Gaussian Processes (MOGPs) provide a principled probabilistic framework for modelling correlated outputs but face scalability bottlenecks when applied to datasets with high-dimensional output spaces. To maintain tractability, existing methods typically resort to restrictive assumptions, such as employing low-rank or sum-of-separable kernels, which can limit expressiveness. We propose the Transformed Latent Variable MOGP (T-LVMOGP), a novel framework that scales MOGPs to a massive number of outputs while preserving the capacity to capture meaningful inter-output dependencies. T-LVMOGP constructs a flexible multi-output deep kernel by mapping inputs and output-specific latent variables into an embedding space using a Lipschitz-regularised neural network. Combined with stochastic variational inference, our model effectively scales to high-dimensional output settings. Across diverse benchmarks, including climate modelling with over 10,000 outputs and zero-inflated spatial transcriptomics data, T-LVMOGP outperforms baselines in both predictive accuracy and computational efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| EEG Prediction | EEG held-out (test) | MSE0.115 | 7 | |
| Multi-Output Gaussian Process Regression | ERA5 block-wise splitting (test) | MSE0.003 | 7 | |
| Spatiotemporal Prediction | ERA5 (random splitting) | MSE0.002 | 7 | |
| Inverse Dynamics Prediction | SARCOS (test) | MSE0.022 | 5 | |
| gene expression prediction | Spatial Transcriptomics dataset | MSE9.189 | 4 | |
| Output extrapolation | Copernicus Marine | MSE0.029 | 4 |