Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Natural Language Understanding on PIQA (Exact Match)
Loading...
59.9
Exact Match
AdapShot
8.732
22.016
35.3
48.584
May 5, 2026
Exact Match
Updated 28d ago
Evaluation Results
Method
Method
Links
Exact Match
AdapShot
Backbone=Qwen2.5-7B
2026.05
59.9
Many-shot (256)
Backbone=Qwen2.5-7B
2026.05
56
Many-shot (512)
Backbone=Qwen2.5-7B
2026.05
52.2
Many-shot (512)
Backbone=LLaMA-3.2 (3B)
2026.05
47.3
AdapShot
Backbone=LLaMA-3.2 (3B)
2026.05
46.9
DBSA
Backbone=Qwen2.5-7B
2026.05
40.4
DBSA
Backbone=LLaMA-3.2 (3B)
2026.05
36.8
Many-shot (256)
Backbone=LLaMA-3.2 (3B)
2026.05
34.5
Few-shot (8)
Backbone=Qwen2.5-7B
2026.05
31.8
Zero-shot
Backbone=Qwen2.5-7B
2026.05
29.8
Few-shot (8)
Backbone=LLaMA-3.2 (3B)
2026.05
19.3
Zero-shot
Backbone=LLaMA-3.2 (3B)
2026.05
10.7
Feedback
Search any
task
Search any
task