Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Dialogue Evaluation on ConvAI2 (C2)
Loading...
10.2
Perplexity
BlenderBot 1
9.852
12.201
14.55
16.899
May 2, 2022
Perplexity
Unigram F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity
Unigram F1
BlenderBot 1
Evaluation Mode=Superv...
2022.05
10.2
0.183
R2C2 BlenderBot
Evaluation Mode=Superv...
2022.05
10.5
0.205
OPT-175B
Evaluation Mode=Unsupe...
2022.05
10.8
0.185
Reddit 2.7B
Evaluation Mode=Unsupe...
2022.05
18.9
0.126
Feedback
Search any
task
Search any
task