Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge-grounded Dialog Generation on WoW (Seen)
Loading...
4.5
Appropriateness Score
Human Response
2.94
3.345
3.75
4.155
May 4, 2022
Appropriateness Score
Informativeness Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Appropriateness Score
Informativeness Score
Human Response
Evaluation Set=DiffKS...
2022.05
4.5
4.3
Human Response
Evaluation Set=Base vs KI
2022.05
4.4
4.3
DiffKS+KI
2022.05
3.9
4
Transformer+KI
2022.05
3.7
3.5
DiffKS
2022.05
3.6
3.6
Transformer
2022.05
3
3.2
Feedback
Search any
task
Search any
task