Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Task-oriented dialogue on In-car personal assistant dialogue dataset (test)
Loading...
3.96
Correctness
Rule-based
3.5024
3.6212
3.74
3.8588
May 15, 2017
Correctness
Appropriateness
Humanlikeness
Updated 4d ago
Evaluation Results
Method
Method
Links
Correctness
Appropriateness
Humanlikeness
Rule-based
2017.05
3.96
3.57
3.28
KV Ret. Net
2017.05
3.7
3.64
3.5
Copy Net
2017.05
3.52
3.63
3.56
Feedback
Search any
task
Search any
task