Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Helpful response generation on Human A/B 100 randomly chosen instances (test)
Loading...
62
Human Preference Score
MuffinGPT-3.5
5.84
20.42
35
49.58
Jan 11, 2024
Human Preference Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Human Preference Score
MuffinGPT-3.5
base_model=GPT-3.5
2024.01
62
GPT-3.5
Setting=in-context lea...
2024.01
8
Feedback
Search any
task
Search any
task