Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-turn Dialogue Generation on Chinese Open-domain Conversation 100-sample (test)
Loading...
1.9
Coherence
PLATO-XL (Diamante)
1.7232
1.7691
1.815
1.8609
Aug 30, 2022
Coherence
Informativeness
Safety
Engagingness
Updated 3mo ago
Evaluation Results
Method
Method
Links
Coherence
Informativeness
Safety
Engagingness
PLATO-XL (Diamante)
2022.08
1.9
1.91
1.96
1.93
Human Reference
2022.08
1.88
1.87
1.92
1.83
PLATO-XL
2022.08
1.73
1.61
1.87
1.56
Feedback
Search any
task
Search any
task