| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Dialogue Generation | PersonaChat (test) | Persona Consistency2.31 | 27 | |
| Turn-level correlation with human Overall Quality ratings | PersonaChat turn-level | Spearman Correlation0.4814 | 20 | |
| Personalized Dialogue Generation | PersonaChat (Human Evaluation) | Fluency3.58 | 16 | |
| Persona Adherence Alignment | PersonaChat 1.0 (test) | Similarity100 | 11 | |
| Persona Simulation Naturalness Evaluation | PersonaChat (test) | CS (Coherence Score)0.718 | 11 | |
| Authorship Verification | PersonaChat original (test) | F1 Score67.1 | 11 | |
| Persona Simulation | PersonaChat | Adherence Score100 | 11 | |
| Dialogue Policy Evaluation | PersonaChat (test) | USR RET97.7 | 10 | |
| Social Inclusion | PersonaChat | Diversity41.52 | 9 | |
| Dialogue Generation | PersonaChat | BLEU-119.05 | 8 | |
| Persona-based Dialogue Generation | PersonaChat (full) | Perplexity7.8 | 6 | |
| Dialogue Coherence | PersonaChat | QuantiDCE3.03 | 5 | |
| Machine Unlearning | PersonaChat (Forget Set) | PDLP100 | 4 | |
| Machine Unlearning | PersonaChat (test) | PDLP80 | 4 | |
| Dialogue Evaluation | PersonaChat | USR RET97.7 | 4 | |
| Response Selection | PersonaChat (test) | R@1 (R20 Context)86.9 | 3 | |
| Dialogue Generation | PersonaChat | Metric- | 0 |