| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MultiWOZ 2.1 (test) | ESAinsTOD (GT) | Joint Goal Accuracy60.76 | 105 | 1mo ago | |
| MultiWOZ 2.2 (test) | LUAS_R+G | Joint Goal Accuracy66.25 | 98 | 27d ago | |
| WOZ 2.0 (test) | AG-DST | Joint Goal Accuracy91.37 | 65 | 3mo ago | |
| MultiWOZ 2.1 | ReacTOD | Joint Goal Accuracy61.29 | 46 | 14d ago | |
| MultiWOZ 2.4 (test) | LUAS_R+G | Joint Goal Acc78.2 | 45 | 3mo ago | |
| DSTC2 (test) | Seq2seq-DU | Joint Goal Accuracy85 | 39 | 3mo ago | |
| MultiWOZ 2.0 (test) | ESAinsTOD (GT) | Joint Goal Accuracy57.23 | 29 | 2mo ago | |
| SGD | HiCoLoRA | JGA (Overall)55.01 | 24 | 1mo ago | |
| Public (test) | + Stream-Only | Joint Goal Accuracy (JGA)97.98 | 15 | 8d ago | |
| MultiWOZ zero-shot 2.1 | HiCoLoRA | Attraction Accuracy38.86 | 11 | 1mo ago | |
| MultiWOZ 2.3 (test) | TripPy | JGA63 | 11 | 3mo ago | |
| MultiWOZ 2.1 (5%) | SVAG + EDZ-DA | Joint Goal Acc44.98 | 11 | 3mo ago | |
| ToDs benchmark GPT-2 backbone (test) | JGA50.03 | 11 | 3mo ago | ||
| SGD (test) | paDST | JGA86.5 | 11 | 3mo ago | |
| MultiWOZ 2.1 (1%) | SVAG + EDZ-DA | Joint Goal Acc37.26 | 10 | 3mo ago | |
| SGD (train) | HiCoLoRA | JGA55.99 | 9 | 1mo ago | |
| SGD Messaging | HiCoLoRA | JGA67.79 | 9 | 1mo ago | |
| MultiWOZ 2.3 | DKF-DST | Joint Goal Accuracy63.1 | 9 | 2mo ago | |
| MultiWOZ 2.2 | DKF-DST | Joint GA62.3 | 9 | 2mo ago | |
| SGD | ReacTOD | Joint GA80.68 | 9 | 14d ago | |
| MultiWOZ 2.4 | DKF-DST | Joint Goal Accuracy77.3 | 8 | 2mo ago | |
| SGD Media | HiCoLoRA | JGA76.2 | 7 | 1mo ago | |
| Sim-R (test) | DiCoS-DST | Joint Goal Accuracy91.5 | 7 | 3mo ago | |
| Sim-M (test) | DiCoS-DST | Joint Goal Accuracy84.7 | 7 | 3mo ago | |
| WOZ 2.0 | Seq2Seq-DU-w/oSchema | Joint GA91.2 | 7 | 3mo ago |