Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following on MT-Bench short-context

8.13MT-Bench Score

officially post-trained

Updated 3mo ago

Evaluation Results

Method	Links
officially post-trained 2024.10		8.13
officially post-trained 2024.10		8.09
DPO w/ SRM 2024.10		7.58
DPO w/ Contrast 2024.10		7.58
DPO w/ LongReward 2024.10		7.58
DPO w/ Contrast 2024.10		7.54
DPO w/ SRM 2024.10		7.5
SFT 2024.10		7.37
DPO w/ LongReward 2024.10		7.24
SFT 2024.10		7.12