Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Response Generation on Human Evaluation multi-turn (200 items) (test)
Loading...
72
Adversarial Win Rate
Adver-REGS
68.4
70.2
72
73.8
Jan 23, 2017
Adversarial Win Rate
Adversarial Loss Rate
Tie Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Adversarial Win Rate
Adversarial Loss Rate
Tie Rate
Adver-REGS
Comparison Baseline=Mu...
2017.01
72
10
18
Feedback
Search any
task
Search any
task