Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Chat on AlpacaEval Length Controlled (test)
Loading...
27.86
AlpLC Score
G-Zero
7.2472
12.5986
17.95
23.3014
May 11, 2026
AlpLC Score
Updated 21d ago
Evaluation Results
Method
Method
Links
AlpLC Score
G-Zero
Backbone=Llama-3.1-8B-...
2026.05
27.86
base model
Backbone=Llama-3.1-8B-...
2026.05
24.12
G-Zero
Backbone=Llama-3.1-8B-...
2026.05
23.88
R-Zero
Backbone=Llama-3.1-8B-...
2026.05
21.74
G-Zero
Backbone=Qwen3-8B-Base...
2026.05
9.07
base model
Backbone=Qwen3-8B-Base...
2026.05
8.94
G-Zero
Backbone=Qwen3-8B-Base...
2026.05
8.47
R-Zero
Backbone=Qwen3-8B-Base...
2026.05
8.04
Feedback
Search any
task
Search any
task