Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

About

Benchmark datasets have a significant impact on accelerating research in programming language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster machine learning research for program understanding and generation. CodeXGLUE includes a collection of 10 tasks across 14 datasets and a platform for model evaluation and comparison. CodeXGLUE also features three baseline systems, including the BERT-style, GPT-style, and Encoder-Decoder models, to make it easy for researchers to use the platform. The availability of such data and baselines can help the development and validation of new methods that can be applied to various program understanding and generation problems.

Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu• 2021

Related benchmarks

TaskDatasetResultRank
Vulnerability DetectionReveal (test)
Precision44.6
42
Vulnerability DetectionCodeXGLUE (test)
Accuracy62.1
18
Clone DetectionPOJ-104 CodeXGLUE (test)
MAP@R82.67
17
Code GenerationConcode CodeXGLUE (test)
EM20.1
14
Vulnerability DetectionREVEAL Chromium Linux Debian Kernel (test)
Precision0.446
12
Docstring GenerationCodeXGLUE Python (test)
BLEU19.06
11
Clone DetectionBigCloneBench CodeXGLUE (test)
F1 Score94.1
11
Code RefinementCodeXGLUE Code Refinement Small v1.0 (test)
BLEU-478.06
10
Code RefinementCodeXGLUE Code Refinement (Medium) v1.0 (test)
BLEU-490.91
10
Text-to-Code generationCodeXGLUE (test)
EM20.1
9
Showing 10 of 19 rows

Other info

Code

Follow for update