Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Graph-based, Self-Supervised Program Repair from Diagnostic Feedback

About

We consider the problem of learning to repair programs from diagnostic feedback (e.g., compiler error messages). Program repair is challenging for two reasons: First, it requires reasoning and tracking symbols across source code and diagnostic feedback. Second, labeled datasets available for program repair are relatively small. In this work, we propose novel solutions to these two challenges. First, we introduce a program-feedback graph, which connects symbols relevant to program repair in source code and diagnostic feedback, and then apply a graph neural network on top to model the reasoning process. Second, we present a self-supervised learning paradigm for program repair that leverages unlabeled programs available online to create a large amount of extra program repair examples, which we use to pre-train our models. We evaluate our proposed approach on two applications: correcting introductory programming assignments (DeepFix dataset) and correcting the outputs of program synthesis (SPoC dataset). Our final system, DrRepair, significantly outperforms prior work, achieving 68.2% full repair rate on DeepFix (+22.9% over the prior best), and 48.4% synthesis success rate on SPoC (+3.7% over the prior best).

Michihiro Yasunaga, Percy Liang• 2020

Related benchmarks

TaskDatasetResultRank
Program synthesisSPoC (TestP)
Success Rate0.385
10
Program synthesisSPoC (TestW)
Success Rate57
10
Program CorrectionDeepFix (test)
Completely Fixed Programs68.2
9
Program RepairDeepFix (test)
Accuracy66.1
9
Automated Program RepairReal-world C programs All errors (test)
Compilation Success Rate34
6
Single-step Error LocalizationDeepFix synthetic (dev)
Accuracy98.9
4
Single-step Error RepairDeepFix synthetic (dev)
Accuracy80.2
4
Showing 7 of 7 rows

Other info

Code

Follow for update