Helpful response generation on Human A/B 100 randomly chosen instances (test)

62Human Preference Score

MuffinGPT-3.5

Updated 5mo ago

Evaluation Results

Method	Links
MuffinGPT-3.5 2024.01		62
GPT-3.5 2024.01		8