Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Generalized Random Forests

About

We propose generalized random forests, a method for non-parametric statistical estimation based on random forests (Breiman, 2001) that can be used to fit any quantity of interest identified as the solution to a set of local moment equations. Following the literature on local maximum likelihood estimation, our method considers a weighted set of nearby training examples; however, instead of using classical kernel weighting functions that are prone to a strong curse of dimensionality, we use an adaptive weighting function derived from a forest designed to express heterogeneity in the specified quantity of interest. We propose a flexible, computationally efficient algorithm for growing generalized random forests, develop a large sample theory for our method showing that our estimates are consistent and asymptotically Gaussian, and provide an estimator for their asymptotic variance that enables valid confidence intervals. We use our approach to develop new methods for three statistical tasks: non-parametric quantile regression, conditional average partial effect estimation, and heterogeneous treatment effect estimation via instrumental variables. A software implementation, grf for R and C++, is available from CRAN.

Susan Athey, Julie Tibshirani, Stefan Wager• 2016

Related benchmarks

TaskDatasetResultRank
Individual Treatment Effect EstimationIHDP (within-sample)
Sqrt PEHE2.9
49
Individual Treatment Effect EstimationIHDP (out-of-sample)--
32
Individual Treatment Effect EstimationOnline gaming product dataset PSM-Matched Ground Truth (out-of-sample)
Epsilon PEHE7.85
16
Individual Treatment Effect EstimationOnline gaming product dataset PSM-Matched Ground Truth (in-sample)
PEHE (Epsilon)7.95
16
Individual Treatment Effect (ITE) EstimationSynthetic
PEHE4.67
16
Individual Treatment Effect (ITE) EstimationSynthetic (out)
PEHE4.73
16
Individual Treatment Effect EstimationOnline gaming product dataset KNN-Matched Ground Truth (out-of-sample)
Epsilon PEHE8.12
16
Individual Treatment Effect EstimationOnline gaming product dataset KNN-Matched Ground Truth in-sample
Epsilon PEHE8.16
16
Individual Treatment Effect (ITE) EstimationNEWS (in)
PEHE1.39
16
Individual Treatment Effect (ITE) EstimationNEWS (out)
PEHE1.4
16
Showing 10 of 14 rows

Other info

Follow for update