Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Simultaneous Missing Value Imputation and Structure Learning with Groups

About

Learning structures between groups of variables from data with missing values is an important task in the real world, yet difficult to solve. One typical scenario is discovering the structure among topics in the education domain to identify learning pathways. Here, the observations are student performances for questions under each topic which contain missing values. However, most existing methods focus on learning structures between a few individual variables from the complete data. In this work, we propose VISL, a novel scalable structure learning approach that can simultaneously infer structures between groups of variables under missing data and perform missing value imputations with deep learning. Particularly, we propose a generative model with a structured latent space and a graph neural network-based architecture, scaling to a large number of variables. Empirically, we conduct extensive experiments on synthetic, semi-synthetic, and real-world education data sets. We show improved performances on both imputation and structure learning accuracy compared to popular and recent approaches.

Pablo Morales-Alvarez, Wenbo Gong, Angus Lamb, Simon Woodhead, Simon Peyton Jones, Nick Pawlowski, Miltiadis Allamanis, Cheng Zhang• 2021

Related benchmarks

TaskDatasetResultRank
Structure learningExtended Synthetic Experiment 24 datasets
Adjacency Recall69.8
9
Causal DiscoverySynthetic Datasets
Adjacency Recall84.3
6
ImputationEedi topics dataset
Accuracy71.47
6
Missing Value ImputationNeuropathic pain (test)
Accuracy94.71
6
Missing Value ImputationSynthetic datasets (test)
RMSE0.1196
6
Structure DiscoveryNeuropathic pain (test)
Adjacency Recall26.1
6
Structure Discovery (Adjacency)Eedi topics dataset
Expert Score3.7
6
Structure Discovery (Orientation)Eedi topics dataset
Avg Expert Score2.76
6
Showing 8 of 8 rows

Other info

Follow for update