Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-domain Conversation on Chinese open-domain conversation Self-chat (test)
Loading...
194.8
Coherence
PLATO-XL (Diamante)
42.544
82.072
121.6
161.128
Aug 30, 2022
Coherence
Informativeness
Safety
Engagingness
Updated 3mo ago
Evaluation Results
Method
Method
Links
Coherence
Informativeness
Safety
Engagingness
PLATO-XL (Diamante)
2022.08
194.8
192
198.8
186
PLATO-XL
2022.08
178.8
162.4
178.8
124
EVA 2.0
2022.08
150.8
135.2
176.4
96
CDial-GPT
2022.08
48.4
40
66
14
Feedback
Search any
task
Search any
task