| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Dialog State Tracking | MultiWOZ 2.1 (test) | Joint Goal Accuracy79.2 | 88 | |
| Dialogue State Tracking | MultiWOZ 2.1 (test) | Joint Goal Accuracy60.61 | 85 | |
| Dialogue State Tracking | MultiWOZ 2.2 (test) | Joint Goal Accuracy66.25 | 80 | |
| End-to-end task-oriented dialogue | MultiWOZ (test) | Task Success Rate94 | 68 | |
| End-to-End Task-Oriented Dialogue | MultiWOZ 2.1 (test) | BLEU Score22.19 | 49 | |
| Dialog State Tracking | MultiWOZ 2.0 (test) | Joint Goal Accuracy55.03 | 47 | |
| Dialogue State Tracking | MultiWOZ 2.4 (test) | Joint Goal Acc78.2 | 45 | |
| Task-Oriented Dialogue | MultiWOZ 2.0 (test) | Inform Rate99.1 | 37 | |
| Response Generation | MultiWOZ (test) | BLEU Score35.1 | 27 | |
| Dialogue State Tracking | MultiWOZ 2.1 | Joint Goal Accuracy60.61 | 26 | |
| Task-Oriented Dialogue | MultiWOZ 2.2 (test) | Inform Rate96.48 | 23 | |
| End-to-end Task-oriented Dialogue | MultiWOZ 2.0 (test) | Inform Accuracy97.5 | 22 | |
| End-to-end Dialogue Modelling | MultiWOZ 2.0 (test) | Inform Rate95.4 | 22 | |
| Knowledge-grounded Dialog Generation | MultiWOZ | Win Rate98.7 | 20 | |
| Spoken Dialogue State Tracking | MultiWOZ (test) | Joint Goal Acc32.4 | 17 | |
| Task-Oriented Dialogue | MultiWOZ 2.4 (test) | JGA43.8 | 15 | |
| Dialogue State Tracking | MultiWOZ 2.0 (test) | Joint Goal Accuracy55.48 | 13 | |
| Task-oriented dialogue | MultiWOZ 2.0 | Inform Rate91.8 | 13 | |
| Task-Focused Dialogue | Multiwoz | TSE Score0.7699 | 11 | |
| Task-Oriented Dialogue | MultiWOZ 2.1 (test) | Inform Rate99.62 | 11 | |
| Dialogue State Tracking | MultiWOZ 2.3 (test) | JGA63 | 11 | |
| Dialogue State Tracking | MultiWOZ 2.1 (5%) | Joint Goal Acc44.98 | 11 | |
| Dialogue State Tracking | MultiWOZ 2.1 (1%) | Joint Goal Acc37.26 | 10 | |
| Natural Language Generation | MultiWOZ 2.2 (test) | Inform90 | 10 | |
| Task-Oriented Dialogue | MultiWOZ 20% 2.0 (train) | Inform90.25 | 10 |