Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GLUE and SuperGLUE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Natural Language UnderstandingGLUE and SuperGLUE (test val)
SST-295.7
37
Sequence ClassificationGLUE & SuperGLUE (MultiRC, COPA, RTE, BoolQ, MRPC, CoLA)
MultiRC Accuracy86.11
24
Natural Language UnderstandingGLUE & SuperGLUE (test)
RTE Accuracy83
19
Natural Language UnderstandingGLUE and SuperGLUE (dev)
BoolQ89.5
12
Natural Language UnderstandingGLUE & SuperGLUE (test dev)
STS-B Score89.8
9
Natural Language UnderstandingGLUE and SuperGLUE (val)
SST-2 Score91.6
6
Showing 6 of 6 rows