Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks

About

While much attention has been given to the problem of estimating the effect of discrete interventions from observational data, relatively little work has been done in the setting of continuous-valued interventions, such as treatments associated with a dosage parameter. In this paper, we tackle this problem by building on a modification of the generative adversarial networks (GANs) framework. Our model, SCIGAN, is flexible and capable of simultaneously estimating counterfactual outcomes for several different continuous interventions. The key idea is to use a significantly modified GAN model to learn to generate counterfactual outcomes, which can then be used to learn an inference model, using standard supervised methods, capable of estimating these counterfactuals for a new sample. To address the challenges presented by shifting to continuous interventions, we propose a novel architecture for our discriminator - we build a hierarchical discriminator that leverages the structure of the continuous intervention setting. Moreover, we provide theoretical results to support our use of the GAN framework and of the hierarchical discriminator. In the experiments section, we introduce a new semi-synthetic data simulation for use in the continuous intervention setting and demonstrate improvements over the existing benchmark models.

Ioana Bica, James Jordon, Mihaela van der Schaar• 2020

Related benchmarks

TaskDatasetResultRank
Continuous ControlMuJoCo Hopper H=10
Normalized Return12.7
10
Continuous ControlMuJoCo Hopper H=20
Normalized Return29.2
10
Continuous ControlMuJoCo Walker2d (H=10)
Normalized Return8.4
10
Continuous ControlMujoco--
7
Continuous ControlMuJoCo Hopper (H=40)
Normalized Return46.2
5
Continuous ControlMuJoCo HalfCheetah H=10
Normalized Return120
5
Continuous ControlMuJoCo HalfCheetah H=20
Normalized Return-0.3
5
Continuous ControlMuJoCo HalfCheetah H=40
Normalized Return-11.4
5
Policy OptimizationMuJoCo Hopper (H=40)
Return46.2
5
Policy OptimizationMuJoCo HalfCheetah H=10
Return1.2
5
Showing 10 of 26 rows

Other info

Follow for update