Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Zero-shot Language Modeling on LLaMA-3-1B
Loading...
9.6
Perplexity (PPL)
FP Model
8.268
17.259
26.25
35.241
May 17, 2026
Perplexity (PPL)
Accuracy (Zero-shot)
Updated 15d ago
Evaluation Results
Method
Method
Links
Perplexity (PPL)
Accuracy (Zero-shot)
FP Model
Backbone=LLaMA-3-1B, W...
2026.05
9.6
58.5
WINQ
Backbone=LLaMA-3-1B, W...
2026.05
16.9
49.3
QuEST
Backbone=LLaMA-3-1B, W...
2026.05
17.4
48.6
WINQ
Backbone=LLaMA-3-1B, W...
2026.05
42.3
43
QuEST
Backbone=LLaMA-3-1B, W...
2026.05
42.9
42.4
Feedback
Search any
task
Search any
task