DeepRV: Accelerating Spatiotemporal Inference with Pre-trained Neural Priors
About
Gaussian Processes (GPs) provide a flexible and statistically principled foundation for modelling spatiotemporal phenomena, but their $O(N^3)$ scaling makes them intractable for large datasets. Approximate methods such as variational inference (VI), inducing-point (sparse) GPs, low-rank kernel approximations (e.g., Nystrom methods and random Fourier features), and approximations such as INLA improve scalability but typically trade off accuracy, calibration, or modelling flexibility. We introduce DeepRV, a neural-network surrogate that replaces GP prior sampling, while closely matching full GP accuracy at inference including hyperparameter estimates, and reducing computational complexity to $O(N^2)$, increasing scalability and inference speed. DeepRV serves as a drop-in replacement for GP prior realisations in e.g. MCMC-based probabilistic programming pipelines, preserving full model flexibility. Across simulated benchmarks, non-separable spatiotemporal GPs, and a real-world application to education deprivation in London (n = 4,994 locations), DeepRV achieves the highest fidelity to exact GPs while substantially accelerating inference. Code is provided in the dl4bi Python package, with all experiments run on a single consumer-grade GPU to ensure accessibility for practitioners.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Spatiotemporal Inference | Synthetic 2D Grid Matérn-3/2 kernel | ESS/sec18.95 | 35 | |
| Spatiotemporal Inference | Matérn-1/2 simulated 2D grid 1.0 (N=256) | MSE0.002 | 6 | |
| Spatiotemporal Inference | Matérn-1/2 simulated 2D grid 1.0 (N=576) | MSE0.005 | 6 | |
| Spatiotemporal Inference | Matérn-1-2 simulated 2D grid 1.0 (N=1024) | MSE (y_gp vs y_hat)0.009 | 6 | |
| Spatiotemporal Inference | Matérn-1/2 simulated 2D grid 1.0 (N=2304) | MSE0.013 | 6 | |
| Spatiotemporal Inference | Matérn-1/2 simulated 2D grid N=4096 1.0 | MSE0.005 | 6 | |
| Spatiotemporal Inference | non-separable spatiotemporal kernel | Inference Time (s)1.10e+3 | 5 | |
| Arbitrary-location Gaussian Process surrogate modeling | Randomly sampled locations in [0, 100] N = 512, 1024, 2048 | LPD-2 | 3 |