Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Natural Language Understanding on CoLA
Loading...
77.7
Exact Match
AdapShot
14.468
30.884
47.3
63.716
May 5, 2026
Exact Match
Updated 28d ago
Evaluation Results
Method
Method
Links
Exact Match
AdapShot
Backbone=LLaMA-3.2 (3B)
2026.05
77.7
AdapShot
Backbone=Qwen2.5-7B
2026.05
69.9
DBSA
Backbone=Qwen2.5-7B
2026.05
65.7
DBSA
Backbone=LLaMA-3.2 (3B)
2026.05
64.9
Many-shot (1024)
Backbone=Qwen2.5-7B
2026.05
55
Many-shot (512)
Backbone=Qwen2.5-7B
2026.05
54.2
Many-shot (512)
Backbone=LLaMA-3.2 (3B)
2026.05
54
Many-shot (1024)
Backbone=LLaMA-3.2 (3B)
2026.05
53.3
Many-shot (256)
Backbone=LLaMA-3.2 (3B)
2026.05
52.8
Many-shot (256)
Backbone=Qwen2.5-7B
2026.05
48.8
Few-shot (8)
Backbone=Qwen2.5-7B
2026.05
48.4
Few-shot (8)
Backbone=LLaMA-3.2 (3B)
2026.05
44.9
Zero-shot
Backbone=LLaMA-3.2 (3B)
2026.05
42.1
Zero-shot
Backbone=Qwen2.5-7B
2026.05
16.9
Feedback
Search any
task
Search any
task