Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Few-shot Language Understanding on SST-2, SST-5, SNLI, MNLI, RTE, TREC (k=16)
Loading...
91.8
Accuracy (SST-2)
FO
78.488
81.944
85.4
88.856
May 1, 2026
Accuracy (SST-2)
Accuracy (SST-5)
Accuracy (SNLI)
Accuracy (MNLI)
Accuracy (RTE)
Accuracy (TREC)
Average Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (SST-2)
Accuracy (SST-5)
Accuracy (SNLI)
Accuracy (MNLI)
Accuracy (RTE)
Accuracy (TREC)
Average Accuracy
FO
Backbone=RoBERTa-large...
2026.05
91.8
47.5
77.5
70
66.4
85
73
AdaMeZO
Backbone=RoBERTa-large...
2026.05
90.9
45.2
66.8
58.6
63.1
71.5
66
MeZO
Backbone=RoBERTa-large...
2026.05
90.6
44.1
67.3
58.1
61.6
67.3
64.8
MeZO-switch
Backbone=RoBERTa-large...
2026.05
90.6
44.3
67.3
58
61.6
67
64.8
Zero-shot
Backbone=RoBERTa-large...
2026.05
79
35.5
50.2
48.8
51.4
32
49.4
Feedback
Search any
task
Search any
task