Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Dialogue Evaluation on Human/Model Chats (test)
Loading...
83
Engagement Score
MMB Style
14.36
32.18
50
67.82
Oct 2, 2020
Engagement Score
Human Preference Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Engagement Score
Human Preference Score
MMB Style
Comparison Baseline=Di...
2020.10
83
67
MMB Style
Comparison Baseline=Di...
2020.10
71
60
MMB Style
Comparison Baseline=Meena
2020.10
63
64
Meena
Baseline vs MMB=Baseline
2020.10
37
36
DialoGPT
Generation Parameters=...
2020.10
29
40
DialoGPT
Generation Parameters=...
2020.10
17
33
Feedback
Search any
task
Search any
task