Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MultiWOZ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dialog State TrackingMultiWOZ 2.1 (test)
Joint Goal Accuracy79.2
88
Dialogue State TrackingMultiWOZ 2.1 (test)
Joint Goal Accuracy60.61
85
Dialogue State TrackingMultiWOZ 2.2 (test)
Joint Goal Accuracy66.25
80
End-to-end task-oriented dialogueMultiWOZ (test)
Task Success Rate94
68
End-to-End Task-Oriented DialogueMultiWOZ 2.1 (test)
BLEU Score22.19
49
Dialog State TrackingMultiWOZ 2.0 (test)
Joint Goal Accuracy55.03
47
Dialogue State TrackingMultiWOZ 2.4 (test)
Joint Goal Acc78.2
45
Task-Oriented DialogueMultiWOZ 2.0 (test)
Inform Rate99.1
37
Response GenerationMultiWOZ (test)
BLEU Score35.1
27
Dialogue State TrackingMultiWOZ 2.1
Joint Goal Accuracy60.61
26
Task-Oriented DialogueMultiWOZ 2.2 (test)
Inform Rate96.48
23
End-to-end Task-oriented DialogueMultiWOZ 2.0 (test)
Inform Accuracy97.5
22
End-to-end Dialogue ModellingMultiWOZ 2.0 (test)
Inform Rate95.4
22
Knowledge-grounded Dialog GenerationMultiWOZ
Win Rate98.7
20
Spoken Dialogue State TrackingMultiWOZ (test)
Joint Goal Acc32.4
17
Task-Oriented DialogueMultiWOZ 2.4 (test)
JGA43.8
15
Dialogue State TrackingMultiWOZ 2.0 (test)
Joint Goal Accuracy55.48
13
Task-oriented dialogueMultiWOZ 2.0
Inform Rate91.8
13
Task-Focused DialogueMultiwoz
TSE Score0.7699
11
Task-Oriented DialogueMultiWOZ 2.1 (test)
Inform Rate99.62
11
Dialogue State TrackingMultiWOZ 2.3 (test)
JGA63
11
Dialogue State TrackingMultiWOZ 2.1 (5%)
Joint Goal Acc44.98
11
Dialogue State TrackingMultiWOZ 2.1 (1%)
Joint Goal Acc37.26
10
Natural Language GenerationMultiWOZ 2.2 (test)
Inform90
10
Task-Oriented DialogueMultiWOZ 20% 2.0 (train)
Inform90.25
10
Showing 25 of 60 rows