Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Alignment on AlpacaEval
Loading...
100
Percent Batches (BWR > 0.50)
AdaBoN
75.04
81.52
88
94.48
May 17, 2025
Percent Batches (BWR > 0.50)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Percent Batches (BWR > 0.50)
AdaBoN
LM=Qwen, RM=Mistral, K...
2025.05
100
AdaBoN
LM=Qwen, RM=FsfairX, K...
2025.05
98
AdaBoN
LM=Mistral, RM=Armo, K...
2025.05
96
AdaBoN
LM=Llama, RM=FsfairX,...
2025.05
96
AdaBoN
LM=Mistral, RM=Mistral...
2025.05
94
AdaBoN
LM=Mistral, RM=FsfairX...
2025.05
92
AdaBoN
LM=Gemma, RM=FsfairX,...
2025.05
92
AdaBoN
LM=Llama, RM=Mistral,...
2025.05
92
AdaBoN
LM=Llama, RM=Armo, K=5...
2025.05
92
AdaBoN
LM=Gemma, RM=Armo, K=5...
2025.05
86
AdaBoN
LM=Qwen, RM=Armo, K=5,...
2025.05
78
AdaBoN
LM=Gemma, RM=Mistral,...
2025.05
76
Feedback
Search any
task
Search any
task