Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Best-of-N Alignment on AlpacaEval (test)
Loading...
62
BWR
AdaBoN
53.68
55.84
58
60.16
May 17, 2025
BWR
Updated 1mo ago
Evaluation Results
Method
Method
Links
BWR
AdaBoN
LM=Qwen, RM=FsfairX, K...
2025.05
62
AdaBoN
LM=Qwen, RM=Mistral, K...
2025.05
60
AdaBoN
LM=Mistral, RM=Armo, K...
2025.05
59
AdaBoN
LM=Llama, RM=FsfairX,...
2025.05
59
AdaBoN
LM=Llama, RM=Armo, K=5...
2025.05
59
AdaBoN
LM=Mistral, RM=Mistral...
2025.05
58
AdaBoN
LM=Mistral, RM=FsfairX...
2025.05
58
AdaBoN
LM=Llama, RM=Mistral,...
2025.05
58
AdaBoN
LM=Gemma, RM=Mistral,...
2025.05
56
AdaBoN
LM=Gemma, RM=Armo, K=5...
2025.05
56
AdaBoN
LM=Gemma, RM=FsfairX,...
2025.05
55
AdaBoN
LM=Qwen, RM=Armo, K=5,...
2025.05
54
Feedback
Search any
task
Search any
task