Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Dialogue Response Generation benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Dialogue Response Generation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Persona-Chat
JGR-R
BLEU-1
53.3
20
4d ago
Topical-Chat Global
GPT-4o mini
Und
98.5
16
3d ago
KEEM (KMSC memories) 1.0 (test)
Llama10.8B tuned Korean
Perplexity
5.47
14
3d ago
ConvAI2
MoCoRP LLM
F1
22.89
12
3d ago
bAbI Dialogue Task 5 OOV
GLMP
Per-response Accuracy
92
9
3d ago
bAbI Dialogue Task 4 OOV
Ptr-Unk
Per-response Accuracy
100
9
3d ago
bAbI Dialogue Task 3 OOV
GLMP
Accuracy (Per Response)
96.7
9
4d ago
bAbI Dialogue Task 2 OOV
GLMP
Accuracy (Per-response)
100
9
4d ago
bAbI Dialogue Task 1 OOV
GLMP
Per-response Accuracy
1
9
3d ago
bAbI Dialogue Task 5
QRN
Per-response Accuracy
99.6
9
3d ago
bAbI Dialogue Task 4
Ptr-Unk
Per-response Accuracy
100
9
3d ago
bAbI Dialogue Task 3
GLMP
Accuracy (Per-response)
96.3
9
4d ago
bAbI Dialogue Task 2
MN
Per-response accuracy
100
9
3d ago
bAbI Dialogue Task 1
GMN
Per-response Accuracy
100
9
3d ago
KEEM memories 1.0 (test)
Llama10.8B tuned Korean
Perplexity
4.56
7
3d ago
Dialogue Dataset (test)
Sampling
Adversarial Success
37.2
7
4d ago
GROWOVER-DIALOGUE (ALL)
RiLM
BLEU Score (Month 9)
4.7
6
2d ago
GROWOVER-DIALOGUE (CHANGED)
RiLM
BLEU (Month 9)
7.26
6
2d ago
100 randomly sampled conversational pairs (test)
SaBART
Appropriateness
66.1
6
4d ago
In-car dialogue dataset (test)
S2S+Intent+JE+EL+KVL
BLEU
18.31
6
4d ago
Ubuntu Dialogue Corpus
MrRNN Act.-Ent.
Activity Precision
16.84
6
2d ago
GROWOVER-DIALOGUE UNCHANGED
RiLM
Score at Month 9
2.66
5
2d ago
GROWOVER-DIALOGUE (NEW)
RiLM
Metric Value (Month 9)
3.61
5
2d ago
PhotoChat (test)
Divter
Kappa
0.68
5
3d ago
SMCalFlow2Text sampled v2.0 (val)
QCFG-Constrained Decoding
Grammaticality
99
4
3d ago
Showing 25 of 35 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Terms of Service
FAQs