Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GLUE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language UnderstandingGLUE
SST-2156
551
Natural Language UnderstandingGLUE (dev)
SST-2 (Acc)97.36
529
Natural Language UnderstandingGLUE (test)
SST-2 Accuracy97.9
416
Natural Language UnderstandingGLUE (val)
SST-297.4
201
Natural Language UnderstandingGLUE (test dev)
MRPC Accuracy93.45
90
General Language UnderstandingGLUE
Accuracy92.5
75
Natural Language UnderstandingGLUE (test)
QNLI7,564.6
64
Model MergingGLUE CoLA, MRPC, RTE, SST-2
Absolute Accuracy75.9
60
Natural Language UnderstandingGLUE (test val)
MRPC Accuracy94
59
Natural Language UnderstandingGLUE
SST-295.18
55
Natural Language UnderstandingGLUE (test)
QNLI94.9
47
Natural Language UnderstandingGLUE
COLA Score69.3
41
Natural Language UnderstandingGLUE
SST-296
40
General Language UnderstandingGLUE v1 (test dev)
MNLI87.86
40
Text ClassificationGLUE
Accuracy (SST-2)89.1
33
Natural Language UnderstandingGLUE (test)
SST-2 Accuracy95.64
33
Text ClassificationGLUE
Accuracy94.1
32
Natural Language UnderstandingGLUE small
CoLA Mcc73.4
32
Adversarial AttackGLUE
SST-2 Speedup3.56
32
Natural Language UnderstandingGLUE
SST-2 Speedup3.06
32
Natural Language UnderstandingGLUE (test)
MNLI Score82.2
30
Natural Language UnderstandingGLUE
Average GLUE Score100
30
Natural Language UnderstandingGLUE
MRPC Score91.91
30
Natural Language UnderstandingGLUE v1 (dev)
MRPC Score93.8
30
Natural Language UnderstandingGLUE
SST-2 Acc97.5
28
Showing 25 of 188 rows
...