Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Dialogue Evaluation on Wizard of Wikipedia (WW)
Loading...
12.4
Perplexity
R2C2 BlenderBot
12.056
14.378
16.7
19.022
May 2, 2022
Perplexity
Unigram F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity
Unigram F1
R2C2 BlenderBot
Evaluation Mode=Superv...
2022.05
12.4
0.198
BlenderBot 1
Evaluation Mode=Superv...
2022.05
12.5
0.189
OPT-175B
Evaluation Mode=Unsupe...
2022.05
13.3
0.152
Reddit 2.7B
Evaluation Mode=Unsupe...
2022.05
21
0.133
Feedback
Search any
task
Search any
task