| Douban Conversation Corpus (test) | BERT_TL | MAP0.675 | | 94 | 4d ago |
| E-commerce (test) | BERT_TL | Recall@1 (R10)0.927 | | 81 | 4d ago |
| Ubuntu (test) | BERT-UMS+FGC | Recall@1 (Top 10)0.886 | | 58 | 4d ago |
| DSTC7 Track 1 (test) | Cross-encoder | Recall@1 (Top 100)91.1 | | 27 | 4d ago |
| ConvAI2 (dev) | Cross-encoder | R@1/2090.3 | | 25 | 4d ago |
| Ubuntu v2 (test) | Cross-encoder | MRR91.9 | | 20 | 4d ago |
| MWOZ 2.1 | FutureTOD | Accuracy (1/100)68.5 | | 17 | 3d ago |
| P-Soups Expertise | Qwen3-32Bthinking | Accuracy83.66 | | 16 | 4d ago |
| P-Soups Style | Qwen3-32Bthinking | Accuracy0.88 | | 16 | 4d ago |
| P-Soups Informativeness | ALIGNXPLORE+ | Accuracy78.07 | | 16 | 4d ago |
| PersonaMem | TALLRec | Accuracy64.36 | | 16 | 4d ago |
| AlignX | ALIGNXPLORE+ | Accuracy75.03 | | 16 | 4d ago |
| ConvAI2 (test) | Cross-encoder | R@2087.9 | | 16 | 4d ago |
| PERSONA-CHAT Revised (test) | P5 | R@182.79 | | 11 | 4d ago |
| PERSONA-CHAT Original Persona (test) | P5 | R@187.45 | | 11 | 4d ago |
| Reddit SC (test) | | Perplexity@Top-1181.8 | | 11 | 4d ago |
| Reddit MC (test) | CFC-QS | Perplexity@1194.8 | | 11 | 4d ago |
| Ubuntu IRC Len-15 (test) | MPC-BERT | R@289.7 | | 10 | 4d ago |
| Ubuntu IRC Len-10 (test) | MPC-BERT | R@289.14 | | 10 | 4d ago |
| Ubuntu IRC Len-5 (test) | MPC-BERT | Recall@287.63 | | 10 | 4d ago |
| Ubuntu IRC (test) | MPC-BERT | R2@194.9 | | 8 | 4d ago |
| Reddit (test) | ConveRT | R@1 (R100)71.8 | | 7 | 4d ago |
| AmazonQA (test) | ConveRT | R@1 (K=100)84.3 | | 6 | 4d ago |
| Focus 1.0 (val) | P5 | R@197.85 | | 3 | 4d ago |
| PersonaChat (test) | Uni-Encoder | R@1 (R20 Context)86.9 | | 3 | 4d ago |