Dialogue Natural Language Inference
About
Consistency is a long standing issue faced by dialogue models. In this paper, we frame the consistency of dialogue agents as natural language inference (NLI) and create a new natural language inference dataset called Dialogue NLI. We propose a method which demonstrates that a model trained on Dialogue NLI can be used to improve the consistency of a dialogue model, and evaluate the method with human evaluation and with automatic metrics on a suite of evaluation sets designed to measure a dialogue model's consistency.
Sean Welleck, Jason Weston, Arthur Szlam, Kyunghyun Cho• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Claim Verification | DIALFACT (test) | Accuracy43.3 | 18 | |
| Claim Verification | DIALFACT (val) | Accuracy42 | 18 | |
| Verifiable Claim Detection | DIALFACT (test) | Accuracy82.1 | 4 |
Showing 3 of 3 rows