| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Knowledge-grounded Dialogue | WoW | F1 Score17.35 | 15 | |
| Retrieval-Augmented Generation | WoW | LLM Score88.87 | 11 | |
| Dialogue | WoW | F1 Score14.77 | 8 | |
| Knowledge-grounded Dialog Generation | WoW (Seen) | Appropriateness Score4.5 | 6 | |
| Natural Language Generation | WoW | Mean Relevance4.68 | 5 | |
| Knowledge-grounded Dialogue | WoW (test) | Dialogue Turns163 | 2 |