Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Robust Gaussian Processes via Relevance Pursuit

About

Gaussian processes (GPs) are non-parametric probabilistic regression models that are popular due to their flexibility, data efficiency, and well-calibrated uncertainty estimates. However, standard GP models assume homoskedastic Gaussian noise, while many real-world applications are subject to non-Gaussian corruptions. Variants of GPs that are more robust to alternative noise models have been proposed, and entail significant trade-offs between accuracy and robustness, and between computational requirements and theoretical guarantees. In this work, we propose and study a GP model that achieves robustness against sparse outliers by inferring data-point-specific noise levels with a sequential selection procedure maximizing the log marginal likelihood that we refer to as relevance pursuit. We show, surprisingly, that the model can be parameterized such that the associated log marginal likelihood is strongly concave in the data-point-specific noise variances, a property rarely found in either robust regression objectives or GP marginal likelihoods. This in turn implies the weak submodularity of the corresponding subset selection problem, and thereby proves approximation guarantees for the proposed algorithm. We compare the model's performance relative to other approaches on diverse regression and Bayesian optimization tasks, including the challenging but common setting of sparse corruptions of the labels within or close to the function range.

Sebastian Ament, Elizabeth Santorella, David Eriksson, Ben Letham, Maximilian Balandat, Eytan Bakshy• 2024

Related benchmarks

TaskDatasetResultRank
RegressionYacht
RMSE0.827
49
RegressionCA Housing
RMSE0.635
45
RegressionNeal
RMSE0.0379
36
RegressionFriedman 5
RMSE0.0729
36
RegressionFriedman 10
RMSE0.0484
36
RegressionYacht
Negative Log Predictive Density4.6
12
RegressionFriedman 5
Neg Log Pred Density3.93
12
RegressionFriedman 10
Negative Log Predictive Density3.89
12
Predictive Density EstimationNeal Constant noise, 15% Corruptions
Negative Log Likelihood (NLL)-2.5
6
Predictive Density EstimationNeal Student-t noise, 15% Corruptions
Negative Log Predictive Density0.199
6
Showing 10 of 54 rows

Other info

Follow for update