Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WOZ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dialogue State TrackingWOZ 2.0 (test)
Joint Goal Accuracy91.37
65
Dialogue State TrackingWOZ 2.0
Joint GA91.2
7
Dialog response generationFEWSHOTWOZ (test)
Informativeness2.92
4
Knowledge RetrievalWoz 2.1 (test)
Joint Accuracy0.8024
3
Dialog Response GenerationFEWSHOTWOZ Taxi 1.0 (test)
BLEU19.7
3
Dialog Response GenerationFEWSHOTWOZ train 1.0 (test)
BLEU17.21
3
Dialog Response GenerationFEWSHOTWOZ Attraction 1.0 (test)
BLEU20.69
3
Dialog Response GenerationFEWSHOTWOZ Restaurant 1.0 (test)
BLEU38.08
3
Goal-oriented dialogueWOZ (test)
dsEM84.9
2
Dialog State TrackingWOZ Italian (IT) (test)
JGA71.4
2
Showing 10 of 10 rows