Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Evaluation on AlpacaEval 2.0
Loading...
51.32
LC Win Rate
SpecEM
4.9152
16.9626
29.01
41.0574
Dec 10, 2024
Feb 23, 2025
May 10, 2025
Jul 24, 2025
Oct 8, 2025
Dec 22, 2025
Mar 8, 2026
LC Win Rate
Average Score
Win Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
LC Win Rate
Average Score
Win Rate
SpecEM
Model type=Methods of...
2024.12
51.32
54.52
-
UniTE
Model type=Methods of...
2024.12
49.2
41.04
-
GenFuse
Model type=Methods of...
2024.12
49.06
50.84
-
Mistral-24b-instruct-2501
Model type=Base LLMs
2024.12
48.46
44.27
-
MOA
Model type=Methods of...
2024.12
46.98
51.24
-
Qwen2.5-32b-instruct
Model type=Base LLMs
2024.12
43.82
43.54
-
Qwen2-72b-instruct
Model type=Base LLMs
2024.12
38.1
-
-
Llama3-70b-instruct
Model type=Base LLMs
2024.12
34.4
29.39
-
Ours
Backbone=LLaMA-7B, Sel...
2026.03
7.7
-
2
Full Dataset
Backbone=LLaMA-7B, Sel...
2026.03
6.7
-
1.9
Feedback
Search any
task
Search any
task