Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Dialogue Evaluation on Wizard of Internet (WoI)
Loading...
12
Perplexity
OPT-175B
11.76
13.38
15
16.62
May 2, 2022
Perplexity
Unigram F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity
Unigram F1
OPT-175B
Evaluation Mode=Unsupe...
2022.05
12
0.147
R2C2 BlenderBot
Evaluation Mode=Superv...
2022.05
14.6
0.16
BlenderBot 1
Evaluation Mode=Superv...
2022.05
14.7
0.154
Reddit 2.7B
Evaluation Mode=Unsupe...
2022.05
18
0.124
Feedback
Search any
task
Search any
task