Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Natural Language Understanding on GLUE (Full Suite Reporting)
Loading...
31.5
CoLA Score
LLaMA
13.092
17.871
22.65
27.429
Feb 12, 2026
CoLA Score
SST-2 Accuracy
MRPC Score
STS-B Score
RTE Score
WNLI Score
QQP Score
QNLI Score
MNLI Score
MNLI Mismatched Score
GLUE Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
CoLA Score
SST-2 Accuracy
MRPC Score
STS-B Score
RTE Score
WNLI Score
QQP Score
QNLI Score
MNLI Score
MNLI Mismatched Score
GLUE Average Score
LLaMA
Evaluation=fine-tuning...
2026.02
31.5
90.8
82.7
78.3
57.8
65.1
68
86
79.8
79.6
71.6
Mamba
Evaluation=fine-tuning...
2026.02
31.1
88.6
80.3
72.8
54.4
65.1
64.8
82.4
74.7
74.7
68.6
ProtoT
Evaluation=fine-tuning...
2026.02
27.7
90
80.1
66.2
53.9
64.6
64.8
81.8
75.3
74.8
67.6
DeltaNet
Evaluation=fine-tuning...
2026.02
13.8
85.8
80.1
67
50.9
65.1
62.6
80.1
71.1
71.8
64.5
Feedback
Search any
task
Search any
task