Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Adversarial random forests for density estimation and generative modeling

About

We propose methods for density estimation and data synthesis using a novel form of unsupervised random forests. Inspired by generative adversarial networks, we implement a recursive procedure in which trees gradually learn structural properties of the data through alternating rounds of generation and discrimination. The method is provably consistent under minimal assumptions. Unlike classic tree-based alternatives, our approach provides smooth (un)conditional densities and allows for fully synthetic data generation. We achieve comparable or superior performance to state-of-the-art probabilistic circuits and deep learning models on various tabular data benchmarks while executing about two orders of magnitude faster on average. An accompanying $\texttt{R}$ package, $\texttt{arf}$, is available on $\texttt{CRAN}$.

David S. Watson, Kristin Blesch, Jan Kapar, Marvin N. Wright• 2022

Related benchmarks

TaskDatasetResultRank
Tabular Data Synthesis Fidelitybiodeg
KS Statistic (Mean)0.55
90
Tabular Data Synthesis Fidelitysteel
KS Statistic (Mean)0.64
90
Tabular Data Synthesis FidelityPROTEIN
Mean KS Statistic0.74
88
Tabular Data Synthesis Fidelityfourier
KS Fidelity0.75
88
Tabular Data Synthesis FidelityTexture
KS Statistic (Mean)0.9
64
Tabular Data Synthesisfourier
Chi-squared Result0.01
48
Tabular Data Synthesisbiodeg
Chi-Squared Test Result0.05
47
Tabular Data Synthesissteel
Chi-squared Test Result0.14
47
Classificationbiodeg
Balanced Accuracy78.78
45
Classificationsteel
Balanced Accuracy66.33
45
Showing 10 of 34 rows

Other info

Follow for update