Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GLUE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language UnderstandingGLUE (dev)
SST-2 (Acc)97.36
504
Natural Language UnderstandingGLUE
SST-2156
452
Natural Language UnderstandingGLUE (test)
SST-2 Accuracy97.9
416
Natural Language UnderstandingGLUE (val)
SST-297.4
170
Natural Language UnderstandingGLUE (test dev)
MRPC Accuracy93.45
81
General Language UnderstandingGLUE
Accuracy92.5
66
Natural Language UnderstandingGLUE (test val)
MRPC Accuracy94
59
Natural Language UnderstandingGLUE
COLA Score69.3
41
General Language UnderstandingGLUE v1 (test dev)
MNLI87.86
40
Natural Language UnderstandingGLUE (test)
SST-2 Accuracy95.64
33
Adversarial AttackGLUE
SST-2 Speedup3.56
32
Natural Language UnderstandingGLUE
SST-2 Speedup3.06
32
Natural Language UnderstandingGLUE v1 (dev)
MRPC Score93.8
30
Natural Language UnderstandingGLUE
SST-2 Acc97.5
28
Natural Language UnderstandingGLUE (test)
QNLI Score92.31
26
Natural Language UnderstandingGLUE (test)
MNLI-mm98.6
26
Binary classificationGLUE (test)
QNLI Accuracy90.66
25
Natural Language UnderstandingGLUE 1.0 (test)
CoLA (MCC)66.4
25
Natural Language UnderstandingGLUE RoBERTa LARGE (test dev)
MNLI Accuracy90.57
22
Natural Language InferenceGLUE (test)
MNLI Acc93.15
18
Natural Language UnderstandingGLUE official (val test)
SST-2 Accuracy0.97
18
Natural Language UnderstandingGLUE RoBERTa-base (val)
CoLA Score60.18
16
Natural Language UnderstandingGLUE
CoLA76.98
16
Natural Language UnderstandingGLUE
CoLA56.7
16
Natural Language UnderstandingGLUE
CoLA Score63.1
15
Showing 25 of 92 rows