Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Dialogue Evaluation on Blended Skill Talk (BST)
Loading...
11.7
Perplexity
R2C2 BlenderBot
11.472
13.011
14.55
16.089
May 2, 2022
Perplexity
Unigram F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity
Unigram F1
R2C2 BlenderBot
Evaluation Mode=Superv...
2022.05
11.7
18.6
BlenderBot 1
Evaluation Mode=Superv...
2022.05
11.9
17.8
OPT-175B
Evaluation Mode=Unsupe...
2022.05
12.1
16.2
Reddit 2.7B
Evaluation Mode=Unsupe...
2022.05
17.4
13.3
Feedback
Search any
task
Search any
task