| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Dialogue Generation | PersonaChat (test) | Persona Consistency2.31 | 27 | |
| Turn-level correlation with human Overall Quality ratings | PersonaChat turn-level | Spearman Correlation0.4814 | 20 | |
| Personalized Dialogue Generation | PersonaChat (Human Evaluation) | Fluency3.58 | 16 | |
| Dialogue Policy Evaluation | PersonaChat (test) | USR RET97.7 | 10 | |
| Dialogue Generation | PersonaChat | BLEU-119.05 | 8 | |
| Persona-based Dialogue Generation | PersonaChat (full) | Perplexity7.8 | 6 | |
| Dialogue Coherence | PersonaChat | QuantiDCE3.03 | 5 | |
| Machine Unlearning | PersonaChat (Forget Set) | PDLP100 | 4 | |
| Machine Unlearning | PersonaChat (test) | PDLP80 | 4 | |
| Dialogue Evaluation | PersonaChat | USR RET97.7 | 4 | |
| Response Selection | PersonaChat (test) | R@1 (R20 Context)86.9 | 3 | |
| Dialogue Generation | PersonaChat | Metric- | 0 |