Prediction-Powered Inference with Inverse Probability Weighting

About

Prediction-powered inference (PPI) is a recent framework for valid statistical inference with partially labeled data, combining model-based predictions on a large unlabeled set with bias correction from a smaller labeled subset. Building on existing PPI results under covariate shift, we show that PPI rectification admits a direct design-based interpretation, and that informative labeling can be handled naturally by Horvitz--Thompson and H\'ajek-style corrections. This connection unites design-based survey sampling ideas with modern prediction-assisted inference, yielding estimators that remain valid when labeling probabilities vary across units. We consider the common setting where the inclusion probabilities are not known but estimated from a correctly specified model. In simulations, the performance of IPW-adjusted PPI with estimated propensities closely matches the known-probability case, retaining both nominal coverage and the variance-reduction benefits of PPI.

Jyotishka Datta, Nicholas G. Polson• 2025

Related benchmarks

Task	Dataset	Result	Rank
Mean Estimation	Informative Labeling Simulation Estimated Inclusion Probabilities (200 replicates)	Estimate0.491		5

Showing 1 of 1 rows

Other info

Follow for update

@wizwand_team Discord