Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Following on AlpacaEval 805 instructions (test)
Loading...
79.91
Win Rate
LoRA (upper bound)
1.1924
21.6287
42.065
62.5013
Oct 23, 2024
Win Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
LoRA (upper bound)
Base LLM=MISTRAL-7B, P...
2024.10
79.91
LoRA (upper bound)
Base LLM=LLAMA2-13B, P...
2024.10
75.16
LoRA (upper bound)
Base LLM=LLAMA2-7B, Pa...
2024.10
68.18
CMC
Base LLM=LLAMA2-70B, P...
2024.10
49.81
CMC
Base LLM=LLAMA2-13B, P...
2024.10
39.04
CMC
Base LLM=MISTRAL-7B, P...
2024.10
33.29
CMC
Base LLM=LLAMA2-7B, Pa...
2024.10
30.41
Vanilla Base Model
Base LLM=LLAMA2-70B, P...
2024.10
11.55
Proxy-tuning
Base LLM=LLAMA2-13B, P...
2024.10
10.47
Proxy-tuning
Base LLM=LLAMA2-70B, P...
2024.10
8.59
Proxy-tuning
Base LLM=LLAMA2-7B, Pa...
2024.10
8.47
Vanilla Base Model
Base LLM=MISTRAL-7B, P...
2024.10
6.83
Vanilla Base Model
Base LLM=LLAMA2-13B, P...
2024.10
5.34
Vanilla Base Model
Base LLM=LLAMA2-7B, Pa...
2024.10
4.22
Feedback
Search any
task
Search any
task