Share your thoughts, 1 month free Claude Pro on usSee more

Dialogue on Dialogue (test)

8.84Fluency

SFT

Updated 4mo ago

Evaluation Results

Method	Links
SFT 2024.06		8.84	7.77	8.49	7.43	7.31
HBAT 2024.06		8.8	7.7	8.48	7.39	7.89
HBAT 2024.06		8.79	7.8	8.45	7.51	7.96
DPO 2024.06		8.67	7.13	8.54	7.63	7.84
RLHF 2024.06		8.39	7.62	8.87	7.47	7.72