Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Following on MT-Bench GPT-4o judge (test)
Loading...
7.76
MT-Bench Score
ScaleBiO
5.212
5.8735
6.535
7.1965
Jun 28, 2024
MT-Bench Score
Updated 4d ago
Evaluation Results
Method
Method
Links
MT-Bench Score
ScaleBiO
Model=Qwen-2-7B
2024.06
7.76
ScaleBiO
Model=Gemma-2-9B
2024.06
7.51
RHO-LOSS
Model=Gemma-2-9B
2024.06
7.38
RHO-LOSS
Model=Qwen-2-7B
2024.06
7.34
LESS
Model=Gemma-2-9B
2024.06
7.2
LESS
Model=Qwen-2-7B
2024.06
7.18
ScaleBiO
Model=Llama-3-8B
2024.06
7.12
RHO-LOSS
Model=Llama-3-8B
2024.06
6.89
Uniform Weighting
Model=Qwen-2-7B
2024.06
6.66
Uniform Weighting
Model=Llama-3-8B
2024.06
6.11
LESS
Model=Llama-3-8B
2024.06
6.06
Uniform Weighting
Model=Gemma-2-9B
2024.06
5.31
Feedback
Search any
task
Search any
task