Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Evaluation on Blended Skill Talk (BST)
Loading...
11.7
Perplexity
R2C2 BlenderBot
11.472
13.011
14.55
16.089
May 2, 2022
Perplexity
Unigram F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Perplexity
Unigram F1
R2C2 BlenderBot
Evaluation Mode=Superv...
2022.05
11.7
18.6
BlenderBot 1
Evaluation Mode=Superv...
2022.05
11.9
17.8
OPT-175B
Evaluation Mode=Unsupe...
2022.05
12.1
16.2
Reddit 2.7B
Evaluation Mode=Unsupe...
2022.05
17.4
13.3
Feedback
Search any
task
Search any
task