Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GLUE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language UnderstandingGLUE
SST-2156
531
Natural Language UnderstandingGLUE (dev)
SST-2 (Acc)97.36
518
Natural Language UnderstandingGLUE (test)
SST-2 Accuracy97.9
416
Natural Language UnderstandingGLUE (val)
SST-297.4
191
Natural Language UnderstandingGLUE (test dev)
MRPC Accuracy93.45
87
General Language UnderstandingGLUE
Accuracy92.5
66
Model MergingGLUE CoLA, MRPC, RTE, SST-2
Absolute Accuracy75.9
60
Natural Language UnderstandingGLUE (test val)
MRPC Accuracy94
59
Natural Language UnderstandingGLUE
SST-295.18
55
Natural Language UnderstandingGLUE
COLA Score69.3
41
General Language UnderstandingGLUE v1 (test dev)
MNLI87.86
40
Natural Language UnderstandingGLUE (test)
MNLI-mm98.6
39
Natural Language UnderstandingGLUE (test)
SST-2 Accuracy95.64
33
Natural Language UnderstandingGLUE small
CoLA Mcc73.4
32
Adversarial AttackGLUE
SST-2 Speedup3.56
32
Natural Language UnderstandingGLUE
SST-2 Speedup3.06
32
Natural Language UnderstandingGLUE v1 (dev)
MRPC Score93.8
30
Natural Language UnderstandingGLUE
SST-2 Acc97.5
28
Text classificationGLUE
Average Score87.6
28
Natural Language UnderstandingGLUE 1.0 (test)
SST-2 (Acc)97.8
28
Natural Language UnderstandingGLUE (test)
QNLI Score92.31
26
Binary classificationGLUE (test)
QNLI Accuracy90.66
25
Natural Language UnderstandingGLUE
MNLI Accuracy61.58
24
Natural Language UnderstandingGLUE MNLI MRPC QNLI QQP SST2 standard (test)
MNLI Accuracy88.08
24
Natural Language UnderstandingGLUE (test)
QNLI7,564.6
23
Showing 25 of 136 rows