Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Persona-Chat

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dialogue Response GenerationPersona-Chat
BLEU-153.3
20
Next Utterance PredictionPersona-Chat (val)
Accuracy77.01
13
Dialogue GenerationPERSONA-CHAT Original (dev)
Hits@189.5
13
Response SelectionPERSONA-CHAT Revised (test)
R@182.79
11
Response SelectionPERSONA-CHAT Original Persona (test)
R@187.45
11
Dialogue GenerationPERSONA-CHAT Revised (dev)
Hits@185
11
Human Evaluation of DialoguePersona-Chat 1.0 (test)
Fluency4.31
9
Profile PredictionPersona-Chat
Error Rate (Profile)1.1
8
Smart ReplyPERSONA-CHAT (test)
ROUGE Score7.71
7
Dialog utterance predictionPERSONA-CHAT Revised v1
Hits@10.354
6
Dialog utterance predictionPERSONA-CHAT Original v1
Hits@151.1
6
Dialog utterance predictionPERSONA-CHAT No Persona v1
Hits@10.349
6
Dialogue ModelingPERSONA-CHAT (val)
Hits@182.1
5
Dialogue ModelingPERSONA-CHAT (test)
F119.5
4
Turn-level dialogue quality evaluation (Uses Knowledge)Persona-Chat turn-level (test)
Spearman Correlation0.6309
3
Turn-level dialogue quality evaluation (Interesting)Persona-Chat turn-level (test)
Spearman Correlation0.2634
3
Turn-level dialogue quality evaluation (Maintains Context)Persona-Chat turn-level (test)
Spearman Corr (Context)0.5625
3
Turn-level dialogue quality evaluation (Understandable)Persona-Chat turn-level (test)
Spearman Correlation (Understandable)0.1324
3
Persona PerceptionPERSONA-CHAT synthesized Revised (test)
Hits@178.2
3
Persona PerceptionPERSONA-CHAT synthesized Original (test)
Hits@193.8
3
Dialogue GenerationPERSONA-CHAT original (dev)
Category 1 Score41.7
3
Showing 21 of 21 rows