Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Evaluation on Wizard of Wikipedia (WW)
Loading...
12.4
Perplexity
R2C2 BlenderBot
12.056
14.378
16.7
19.022
May 2, 2022
Perplexity
Unigram F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Perplexity
Unigram F1
R2C2 BlenderBot
Evaluation Mode=Superv...
2022.05
12.4
0.198
BlenderBot 1
Evaluation Mode=Superv...
2022.05
12.5
0.189
OPT-175B
Evaluation Mode=Unsupe...
2022.05
13.3
0.152
Reddit 2.7B
Evaluation Mode=Unsupe...
2022.05
21
0.133
Feedback
Search any
task
Search any
task