Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialog Generation on CRD (test)
Loading...
4.73
Appropriateness
Human Response
3.6068
3.8984
4.19
4.4816
May 4, 2022
Appropriateness
Informativeness
Updated 1mo ago
Evaluation Results
Method
Method
Links
Appropriateness
Informativeness
Human Response
2022.05
4.73
4.21
Transformer+KI
2022.05
4.22
3.51
Transformer
2022.05
3.65
3.15
Feedback
Search any
task
Search any
task