| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| PRIST 1.0 (test) | OMG-LLaVA | BLEU-40.1121 | 13 | 4d ago | |
| QReCC (test) | ChatRetriever + Mistral | F1 Score26.3 | 10 | 4d ago | |
| TopiOCQA (test) | UniConv | F1 Score0.296 | 10 | 4d ago | |
| Reddit (test) | Dist-10.947 | 9 | 4d ago | ||
| INSCIT (test) | UniConv | F1 Score33.2 | 9 | 4d ago | |
| OR-QUAC (test) | F1 Score17.8 | 9 | 4d ago | ||
| ReDial (test) | CR-Walker | Fluency2.6 | 7 | 4d ago | |
| ReDial | STARCRS | Fluency82 | 6 | 4d ago | |
| Bitext Retail Banking LLM Chatbot (test) | SFT Model | BLEU26.85 | 5 | 4d ago | |
| Cornell Movie Dialog 110K Data | PALM | Perplexity21.98 | 4 | 4d ago | |
| Cornell Movie Dialog 10K Data | PALM | Perplexity45.43 | 4 | 4d ago |