Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Instruction Following on AlpacaEval LC 2.0
Loading...
62.8
AlpacaEval 2.0 LC Score
Llama-3.1-Nemotron-70B-Instruct
33.888
41.394
48.9
56.406
Mar 6, 2025
AlpacaEval 2.0 LC Score
Standard Error (SE)
Updated 1mo ago
Evaluation Results
Method
Method
Links
AlpacaEval 2.0 LC Score
Standard Error (SE)
Llama-3.1-Nemotron-70B-Instruct
Feedback + Edit Protoc...
2025.03
62.8
1.3
Llama-3.1-Nemotron-70B-Instruct
Feedback + Edit Protoc...
2025.03
57.6
1.65
GPT-4o-2024-05-13
2025.03
57.5
1.47
Claude-3-5-Sonnet-20240620
2025.03
52.4
1.47
Llama-3.1-405B-Instruct
2025.03
39.3
1.43
Llama-3.1-70B-Instruct
2025.03
38.1
0.9
Llama-3.3-70B-Instruct
Feedback + Edit Protoc...
2025.03
36.9
1.5
Llama-3.3-70B-Instruct
Feedback + Edit Protoc...
2025.03
35
1.45
Feedback
Search any
task
Search any
task